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Novel method for identifying antibacterial compounds 

The present invention relates to a method for identifying an antagonist or inhibitor 
of the expression of a gene encoding a polypeptide essential for bacterial growth 
or survival as well as for an antagonist or inhibitor of said polypeptide. The 
invention further relates to a method for improved antagonists or inhibitors. The 
invention also provides an antagonist or inhibitor of the activity of said 
polypeptide. The invention is further related to a method for producing a 
composition comprising said antagonist or inhibitor. Furthermore, the invention is 
related to the use of the polypeptide and the antagonist or inhibitor as well as to a 
method to identify a surrogate marker. 

Several documents are cited throughout the text of this specification. Each of the 
documents cited herein (including any manufacturer's specifications, instructions, 
etc.) are hereby incorporated by reference; however, there is no admission that 
any document cited is indeed prior art of the present invention. 

Since the beginning of the 1980s, a new trend has been observed in the 
industrialized countries. On the one hand, resistances to antibiotics have 
increased, which' make it difficult or even impossible to treat many of the disease- 
causing agents. On the other hand, new infectious diseases, which had been 
unknown up to now, arise, and old diseases return. For example, diphteria and 
tuberculosis are old epidemics and increasingly surmounting in many different 
parts of the world. Especially tuberculosis (TB). a chronic infectious disease that is 
generally caused by infection with Mycobacterium tuberculosis, is a disease of 
major concern. Each year, 8 to 10 million new cases of TB are described, and, 
causing more than three million deaths per year, TB is a major disease in 
developing countries as well as an increasing problem in developed areas of the 
world due to. for example, antibiotic resistance. 
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Additionally, M. bovis BCG vaccination has failed to protect against TB in several 
trials (WHO. Tech. Rep. Ser. (1980), 651. 1-15) for reasons that are not entirely 
clear (Fine, Tubercle 65 (1984). 137-153). It has been shown that the vaccine 
strain of M. bovis BCG only confers protection against the severe form of miliary 
tuberculosis in children (Fine, Lancet 346 (1995), 1339-1345). In contrast, its 
protective capacity against the most common form, pulmonary tuberculosis in 
adults, is low and highly variable (Colditz (1994), JAMA 271, 698). 

The causes for this new trend are complex: mainly, the increasing number of 
antibiotic applications in medicine and agriculture often combined with an 
improper und uncontrolled use, helps to establish resistant organisms and 
generate the threat of bacterial infections resistant to all available therapies. 
Conventional techniques of developing antibiotics, i.e. synthesis of candidate 
substances and screening for antibacterial substances, even though speeded up 
by several orders of magnitude by the use of combinatorial approaches in recent 
years (e.g. US5324483. US5545568). are still too inefficient as they involve 
multiple screening steps of hundreds or thousands of more or less randomly 
chosen substances for efficiency in combating various infectious agents. 
Therefore, it is a major concern to fight the growing number of bacterial infections 
due to an increased frequency of multiple antibiotic resistances and to improve 
the available antibacterial therapies. 

Thus, the technical problem underlying the present invention was to provide a 
method and means for the development of an additional effective antibacterial 
therapy of infected humans and animals that can be used for the. treatment of a 
broad spectrum of bacterial infections or diseases or disorders related to bacterial 
infections. The solution, to this technical problem is achieved by providing the 
embodiments characterized in the claims. 

Accordingly, the present invention relates to a method for identifying an antagonist 
or inhibitor of the expression of a gene encoding a polypeptide essential for 
bacterial growth wherein said gene is selected from the group consisting of ygbB, 
yfhC, yacE, ychB. yejD, yrfl, yggJ, yjeE. yiaO, yrdC, yhbC, ygbP, ybeY, gcpE, 
kdtB, pfs, ycaJ, b1808, yeaA, yagF, b1983, yidD. yceG and/or yjbC the sequence 



PCT/EPOO/03135 

WO 00/61793 

3 

of said genes being shown in Fig. 1, or a fragment or derivative or orthoiog 
thereof, said method comprising the steps of 

(a) testing a candidate antagonist or inhibitor or a sample comprising a plurality 
of said candidate antagonists or inhibitors for the inhibition or reduction of 
transcription of said gene or a fragment or derivative thereof; or 

(b) testing a candidate antagonist or inhibitor or a sample comprising a plurality 
of said candidate antagonists or inhibitors for the inhibition or reduction of 
translation of mRNA transcribed from said gene or a fragment or derivative 
thereof; and 

(c) identifying an antagonist or inhibitor or a sample comprising a plurality of 
said candidate antagonists or inhibitors that tests positive in step (a) and/or 
(b). 

The term "antagonist" or "inhibitor" as used herein means naturally occurring and 
synthetic compounds capable of counteracting or inhibiting an activity of a gene or 
gene product or interactions of the gene or gene product with other genes or gene 
products. Determining whether a compound is capable of . inhibiting or 
counteracting specific gene expression can be done, for example, by Northern 
blot analysis, Western blot analysis or proteome analysis. It can further be done 
by monitoring the phenotypic characteristics of a bacterial cell contacted with the 
compounds and compare it to that of a wild-type cell. In an additional 
embodiment, said characteristics may be compared to that of a cell contacted with 
a compound which is either known to be capable or incapable- of suppressing or 
activating the protein or gene, respectively, according to the invention. For 
example, the bacterial cell can be a transgenic cell and the phenotypic 
characteristics comprises a readout system. Further examples of determining 
whether a compound is capable of inhibiting or counteracting specific gene 
expression are described below. 

The term "expression" means the production of a protein or nucleotide sequence 
in a cell. However, said term also includes expression of the protein in a cell-free 
system. It includes transcription into an RNA product, and/or translation into a 
polypeptide from a DNA encoding that product. 

The term "transcription" as used herein means a DNA template dependent 
synthesis of a ribonucleic acid polymer encoding a polypeptide or a regulatory 
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sequence. The term "translation" as used herein means the polymerization of a 
polypeptide that is encoded by an RNA molecule by a protein complex. 
As used in accordance with the present invention, the term "fragment or 
derivative" denotes any variant the amino acid or nucleotide sequence of which 
deviates in its primary structure, e.g., in sequence composition or in length as well 
as to analogue components. For example, one or more amino acids of a 
polypeptide may be replaced in said fragment or derivative as long as the 
modified polypeptides remain functionally equivalent to their described 
counterparts. The .term "fragment or derivative" further denotes compounds 
analog to an antagonist or inhibitor that should have a stabilized electronic 
configuration and molecular conformation that allows key functional groups to be 
presented to the mentioned polypeptide in substantially the same way as the 
antagonist and inhibitor. The variant of the polypeptide may be a naturally 
occurring allelic variant of the polypeptide or non-naturally occurring variants of 
those polynucleotides. 

The term "orthologs" as used herein means homologous sequences in different 
species that evolved from a common ancestoral gene by speciation. Normally, 
orthologs retain the same function in the course of evolution. However, 
orthologous genes may or may not be responsible for a similar function (see, e.g., 
the glossary of the "Trends Guide to Bioinformatics", Trends Supplement 1998, 
Elsevier Science). Orthologous genes, nucleic acids or proteins comprise genes, 
nucleic acids or proteins which have one or more sequences or structural motifs in 
common. For example, the sequence motifs of proteins can comprise short, i.e. 
repetitive sequences or amino acid positions conserved in the primary structure 
and/or conserved in higher protein structures, e.g. secondary or tertiary structure. 
Orthologous nucleic acids or genes can comprise molecules having short 
stretches of one or more homologous (same or similar) sequences, for example 
protein binding boxes or structure forming boxes. Methods for the identification of 
a candidate ortholog of a gene or polypeptide described herein are known to 
those skilled in the art and are described for example in Sambrook et al. (1989), 
Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory Press, 
New York, or Ausubel (1994), Current Protocols in Mol. Biol.. The person skilled in 
the art knows how to identify orthologous genes, nucleic acids or polypeptides by 
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computer supported analysis (e.g. BLAST) of known sequences and its 
interpretation. 

The terms "gene", "polynucleotide", "nucleic acid sequence", "nucleotide 
sequence", "DNA sequence" or "nucleic acid molecule" as used herein refer to 
polymeric forms of nucleotides of any length, either • ribonucleotides or 
deoxyribonucleotides and only to the primary structure of the molecule. Thus, 
these terms include double- and single-stranded DNA, and RNA. They also 
include known types of modifications, for example, methylation, "caps" 
substitution of one or more of the naturally occurring nucleotides with an analog. 
Preferably, the DNA sequence of the invention comprises a coding sequence 
encoding at least the mature form of the above defined protein, i.e. the protein 
which is posttranslationally processed in .its biologically active form, for example 
due to cleavage of leader or secretory sequences or a proprotein sequence or 
other natural proteolytic cleavage points. 

The term "plurality of candidate antagonists or inhibitors" is to be understood as a 
plurality of substances which may or may not be identical. 

Said antagonists or inhibitors or plurality of candidate antagonists or inhibitors 
may be chemically synthesized or microbiologically produced and/or comprised in, 
for example, samples, e.g., cell extracts from, e.g., plants, animals or 
microorganisms. Furthermore, said compound(s) may be known in the art but 
hitherto not known to be capable of suppressing or inhibiting said polypeptide. 
The reaction mixture may be a cell free extract or may comprise a cell or tissue 
culture. Suitable set ups for the method of the invention are known to the person 
skilled in the art and are, for example, generally described in Alberts et al., 
Molecular Biology of the Cell, third edition (1994), in particular Chapter 17. The 
plurality of compounds may be, e.g., added to the reaction mixture, culture 
medium, injected into the cell or sprayed onto the plant. 

By combining computational processing of genomic information, with microbial 
genetics, the inventors have been able to identify 24 E. coli essential genes and 
their respective orthologs (Fig. 3) that fulfill several criteria for being attractive 
antibacterial targets: hypothetical open reading frames, coding for essential 
functions (mutation is lethal for growth in rich media), broad conservation 
(orthologs are present in a wide range of bacteria including H. influenza, S. 
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pneumoniae. H. pylori, and B. burgdorferi) (Fig. 3) and low toxicity potential in 
higher organisms (mostly no orthologs are identified in the simple eukaryote S. 
cerevisiae). Thus, an antagonist or inhibitor of the expression of such an essential 
gene or of its function provides the key for an antibacterial therapy. The inventors 
assume that said antagonist or inhibitor stops or reduces bacterial growth and/or 
mediates bacterial death. 

Thus, the method of the present invention provides the options of development of 
new broad spectrum antibiotics against new pharmaceutical important targets. 
The findings of the present invention are particularly important in view of the. 
drawbacks of the present forms of treatment of bacterial infections, diseases and 
disorders related to bacterial infections. 

In line with the above, the present invention also relates to a method for testing a 
candidate antagonist or inhibitor of a polypeptide or mRNA essential for bacterial 
growth or survival encoded by a gene selected from the group consisting of ygbB, 
yfhC, yacE, ychB, yejD, yrfl, yggJ, yjeE, yiaO. yrdC, yhbC, ygbP, ybeY, gcpE, 
kdtB, pfs, ycaJ, b1808, yeaA, yagF, b1983, yidD, yceG and/or yjbC or a fragment, 
derivative or ortholog thereof comprising the steps of 

(a) contacting a bacterial cell with candidate antagonist or inhibitor or a sample 
comprising a plurality of said candidate antagonists or inhibitors; and 

(b) testing whether said contacting leads to cell growth inhibition and/or cell 
death. 

In a further embodiment, the present invention relates to a method for testing a 
candidate antagonist or inhibitor of the function of a gene essential for bacterial 
growth or survival wherein said gene is selected from the group consisting of 
ygbB, yfhC, yacE, ychB, yejD, yrfl, yggJ, yjeE, yiaO, yrdC, yhbC, ygbP, ybeY, 
gcpE, kdtB, pfs, ycaJ. b1808, yeaA, yagF, b1983, yidD, yceG and/or yjbC or a 
fragment, derivative or ortholog thereof, comprising the steps of 
(a) contacting a bacterial cell comprising said gene with a candidate antagonist 

or inhibitor or a sample comprising a plurality of said candidate antagonists 

or inhibitors; and 
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(b) testing whether said contacting leads to cell growth inhibition and/or cell 
death. 

Bacteria, for which was shown that a gene as mentioned above expressed is 
essential, can be used in a proliferation assay to identify both ligands and 
potential antagonists or inhibitors to said polypeptide encoded by said essential 
gene. For example, E.'coli are grown in culture medium and incorporation of DNA 
precursors such as 3 H-thymidine or 5-bromo-2'-deoxyuridine (BrdU) is monitored 
as a parameter for DNA synthesis and cellular proliferation. Cells which have 
incorporated BrdU into DNA can be detected using a monoclonal antibody against 
BrdU and measured by an enzyme or fluorochrome-conjugated second antibody. 
The reaction is quantitated by fluorimetry or by spectrophotometry. The ability of 
the compound to be screened to inhibit proliferation may then be quantified. 
Further methods to determine growth and proliferation of bacteria are well known 
in the art, for example in Drews, Mikrobiol. Praktikum, Berlin, 1976. . 
Preferably, the antagonist or inhibitor binds to the gene product, i.e. the RNA or 
polypeptide, specifically encoded by said gene. 

For example, a candidate antagonist or inhibitor not known to be capable of 
binding to an polypeptide encoded by a essential gene as described above can be 
tested to bind thereto comprising contacting a bacterial cell comprising an isolated 
•molecule encoding said polypeptide with a candidate antagonist or inhibitor under 
conditions permitting binding of ligands known to bind thereto, detecting the 
presence of any bound ligand, and thereby determining whether such candidate 
antagonist or inhibitor inhibits the binding of a ligand to a polypeptide as described, 
above. 

Proteins that bind to a polypeptide as described above and might inhibit or 
counteract, to said polypeptide can be "captured" using the yeast two-hybrid 
system (Fields, Nature 340 (1989), 245-246). A modified version of the yeast two- 
hybrid system has been described by Roger Brent and his colleagues (Gyuris, 
Cell 75 (1993), 791-803; Zervos, Cell 72 (1993), 223-232). Briefly, a domain of the 
polypeptide is used as bait for binding compounds. Positives are then selected by 
their ability to grow on plates lacking leucine, and then further tested for. their 
ability to turn blue on plates with X-gal, as previously described in great detail 
(Gyuris, supra; WO 95/31544). Once amino acid sequences are identified which 
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bind to a polypeptide essential for bacterial growth or survival, these sequences 
can be screened for antagonist activity using, for example, the proliferation assay 
described above or used for screening for antagonists of said binding. 
Another assay which can be performed to identify inhibitors and antagonists 
involves the use of combinatorial chemistry to produce random peptides which 
then can be screened for both binding affinity and antagonist effects. One such 
assay has recently been performed using random peptides expressed on the 
surface of a bacteriophage (Wu (1996), Nature Biotechnology 14, 429-431). 

In a preferred embodiment of the method' of the present invention said method 
further comprises identifying an antagonist or inhibitor optionally from said sample 
of candidate antagonists or inhibitors. 

If a sample contains a candidate antagonist or inhibitor, or a plurality of candidate 
antagonists or inhibitors, as identified in the method of the invention, then it is 
either possible to isolate the candidate antagonists or inhibitors from the original 
sample identified as containing the compound capable of suppressing or inhibiting 
bacterial growth or survival, or one can further subdivide the original sample, for 
example, if it consists of a plurality of different candidate antagonists or inhibitors, 
so as to reduce the number of different substances per sample and repeat the 
method with the subdivisions of the original sample. Depending on the complexity 
of the samples, the steps described above can be performed several times, 
preferably until the sample identified according to the method of the invention only 
comprises a limited number of or only one substance(s). Preferably said sample 
comprises substances of similar chemical and/or physical properties, and most 
preferably said substances are identical. As regards the identification of candidate 
antagonists or inhibitors by any of the above-referenced embodiments of the 
invention, a variety of formats or tools is. available to the person skilled in the art. 
Thus, several -methods are known to the person skilled in the art for producing 
and screening large libraries to identify compounds having specific affinity for a 
target. These methods include the phage-display method in which randomized 
peptides are displayed from phage and screened by affinity chromatography to an 
immobilized receptor; see,' e.g., WO 91/17271, WO 92/01047, US-A-5.223.409. In 
another approach, combinatorial libraries of polymers immobilized on a chip are 
synthesized using, photolithography; see, e.g., US-A-5.143.854, WO 90/15070 
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and WO 92/10092. The immobilized -polymers are contacted with a labeled 
receptor and scanned for label to identify polymers binding to the receptor. The 
synthesis and screening of peptide libraries on continuous cellulose membrane 
supports that can be used for identifying binding ligands of the polypeptide of the 
invention and thus possible inhibitors and antagonists is described, for example, 
in Kramer. Methods Mol. Biol. 87 (1998), 25-39. This method can also be used, 
for example, for determining, the binding sites and the recognition motifs in the 
polypeptide as described above. In like manner, the substrate specificity of the 
DnaK chaperon was determined and the contact sites between human interieukin- 
6 and its receptor; see Rudiger, EMBO J. 16 (1997), 1501-1507 and Weiergraber. 
FEBS Lett. 379 (1996), 122-126, respectively. Furthermore, the above-mentioned 
methods'can be used for the construction of binding supertopes derived from the 
polypeptide of the invention. A similar approach was successfully described for 
peptide antigens of the anti-p24 (HIV-1) monoclonal antibody; see Kramer, Cell 91 
(1997). 799-809. A general route to fingerprint analyses of peptide-antibody 
interactions, using the clustered amino acid peptide library was described in 
Kramer, Mol. Immunol. 32 (1995), 459-465. In addition, antagonists or inhibitors of 
a polypeptide described above can be derived and identified from monoclonal 
. antibodies that specifically react with said polypeptide in accordance with the 
methods as described in Doring, Mol. Immunol. 31 (1994), ,1059-1067. 
More recently, WO 98/25146 described further methods for screening libraries of 
complexes for compounds having a desired property, especially, the capacity to 
agonize, bind to, or antagonize a polypeptide or its cellular receptor. The 
-complexes in such libraries comprise a compound under test, a tag recording at 
least one step in synthesis of the compound, and a tether susceptible to 
modification by a reporter molecule. Modification of the tether is used to signify 
that a complex contains a compound having a desired property. The tag can be 
decoded to reveal at least one step in the synthesis of such a compound. Other 
methods for identifying compounds which interact with the proteins according to the . 
invention or nucleic acid molecules encoding such molecules are, for example, the 
in vitro screening with the phage display system as well as filter binding assays or 
"real time" measuring of interaction using, for example, the BIAcore apparatus 
(Pharmacia). 
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All these methods can be used in accordance with the present invention to identify 
antagonists and inhibitors of the polypeptide of the invention. 

Additionally, the present invention relates in a preferred embodiment to a method 
comprising improving inhibitors or antagonists identified by peptidomimetics or by 
applying phage display or combinatorial library technique step(s). 
Peptidomimentics, phage display and combinatorial library techniques are well- 
known in the art and can be applied by the person skilled in the art without further 
ado to the improvement of the antagonist or inhibitor that is identified by the basic 
method referred to herein above. 

Methods for the generation and use of peptidomimetic combinatorial libraries are 
described in the prior art, for example in Ostresh, Methods In Enzymology 267 
(1996), 220-236; Dosner, Bioorg. Med. Chem. 4 (1996), 709-715; Beeley, Trends 
Biotechn. 12 (1994), 213-216; al-Obeidi, Mol. Biotechn. 9 (1998), 205-223; Wiley, 
Med. Res, Rev. 13 (1993), 327-384; Bohm, J.Comput. Aided Mol. Des. 10 
(1996), 265-272; and Hruby, Biopolymers 43 (1997), 219-266. 
Various sources for the basic structure of such an antagonist or inhibitor can be 
employed and comprise, for example, mimetic analogs of the polypeptide of the 
invention. Mimetic analogs of the polypeptide of the invention or biologically active 
fragments thereof can be generated by, for example, substituting the amino acids 
that are expected to be essential for the biological activity with, e.g., 
stereoisomers, i.e. D-amino acids; see e.g., Tsukida, J. Med. Chem. 40 (1997), 
3534-3541. furthermore, in case fragments are used for the design of biologically 
active analogs pro-mimetic components can be incorporated into a peptide to 
reestablish at least some of the conformational properties that may have been lost 
upon removal of part of the original polypeptide; see, e.g., Nachman, Regul. Pept. 
57 (1995), 359-370. Furthermore, the polypeptide can be used to identify 
synthetic chemical peptide mimetics that bind to or can function as a ligand, 
substrate, binding partner or the receptor of the polypeptide as effectively as does 
the natural polypeptide; see, e.g., Engleman, J. Clin. Invest. 99 (1997), 2284- 
2292. 

The structure-based design and synthesis of low-molecular-weight synthetic 
molecules that mimic the activity of the native biological polypeptide is further 
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described in, e.g.. Dowd, Nature Biotechnol. 16 (1998), 190-195; Kieber-Emmons, 
Current Opinion Biotechnol. 8 (1997), 435-441; Moore, Proc. West Pharmacol. 
Soc. 40 (1997), 115-119; Mathews, Proc. West Pharmacol. Soc. 40 (1997), 121- 
125; Mukhija, European J. Biochem. 254 (1998), 433-438. 

It is also well known to the person skilled in the art. that it is possible to design, 
synthesize and evaluate mimetics of small organic compounds that, for example, 
can act as a" substrate or ligand to a polypeptide as encoded by the essential 
gene as identified above. For example, it has been described that D-glucose 
mimetics of hapalosin exhibited similar efficiency as hapalosin in antagonizing 
multidrug resistance assistance-associated protein in cytotoxicity; see Dinh, J. 
Med.Chem. 41 (1998), 981-987. - 
the essential gene described above or the RNA encoded thereof, as has been 
described above, can also serve as a target for antagonists, or inhibitors. 
Antagonists may comprise, for example, proteins that bind to the mRNA of said 
gene, thereby destabilizing the native conformation of the mRNA and disturbing 
transcription and/or translation. Furthermore, methods are described in the 
literature for identifying nucleic acid molecules such as an RNA fragment that 
mimics the structure of a defined or undefined target RNA. molecule to which a 
compound binds inside of a cell resulting in retardation of cell growth or cell death; 
see, e.g., WO 98/18947 and references cited therein. These nucleic acid 
molecules can be used for identifying unknown compounds of pharmaceutical 
and/or agricultural interest, and for identifying unknown RNA targets for use in 
treating a disease. These methods and compositions can be used in screening for 
novel antibiotics, bacteriostatics, or modifications thereof or for identifying 
compounds useful to alter expression levels of proteins encoded by a nucleic acid 
molecule. Alternatively, for example, the conformational structure of the RNA 
fragment which mimics the binding site can be employed in rational drug design to 
modify known antibiotics to make them bind more avidly to the target. One such 
methodology is nuclear magnetic resonance (NMR), which is useful to identify 
drug and RNA conformational structures. Still other methods are, for example, the 
drug design methods as described in WO 95/35367, US-A-5,322.933, where the 
crystal structure of the RNA fragment can be deduced and computer programs 
are utilized to design novel binding compounds which can act as antibiotics. 
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The candidate antagonists and inhibitors which can be tested and identified 
according to a method of the invention may be taken from expression libraries, 
e.g., cDNA expression libraries, peptides, proteins, nucleic acids, antibodies, 
small organic compounds, hormones, peptidomimetics, PNAs or the like (Milner, 
Nature Medicine 1 (1995), 879-880; Hupp, Cell 83 (1995), 237-245; Gibbs, Cell 79 
(1994), 193-198 and references cited supra). Furthermore, genes encoding a 
putative regulator of an essential bacterial protein and/or which' exert their effects 
up- or downstream said protein may be identified using, for example, insertion 
mutagenesis using, for example, gene targeting vectors known in the art (see, 
e.g., Hayashi, Science 258 (1992), 1350-1353; Fritze and Walden, Gene 
activation by T-DNA tagging. In Methods in Molecular biology 44 (Gartland, 
K.M.A. and Davey, M.R., eds). Totowa: Human Press (1995), 281-294.) or 
transposon tagging (Chandlee. Physiologia Plantarum 78 (1990), 105-115). Said 
compounds can also be functional derivatives or analogues of known inhibitors or 
antagonists. Such useful compounds can be for example transacting factors 
which bind an above-described polypeptide. Identification of transacting factors 
can be carried out using standard methods in the art (see, e.g., Sambrook, supra v 
and Ausubel, supra). To determine whether a protein binds to the protein or 
regulatory sequence of the invention, standard native gel-shift analyses can be 
carried out. In order to identify a transacting factor which binds to the protein or 
regulatory sequence of the invention, the protein or regulatory sequence of the 
invention can be used as an affinity reagent in standard protein purification 
methods, or as a probe for screening an expression library. The identification of 
nucleic acid molecules which encode proteins which interact with the polypeptide 
described above can also be achieved, for example, as described in Scofield 
(Science 274 (1996), 2063-2065) by use of the so-called yeast "two-hybrid system"; 
see also the appended example. In this system, e.g., the protein encoded by the 
nucleic acid molecules identified in this invention or a smaller part thereof is linked to 
the DNA-binding domain of the GAL4 transcription factor. A yeast strain expressing 
this fusion gene, and comprising a lacZ reporter gene driven by an appropriate 
promoter, which is recognized by the GAL4 or LexA. transcription factor, is 
transformed with a library of. cDNAs which will express plant genes or fragments 
thereof fused to an activation domain. Thus, if a peptide encoded by one of the 
cDNAs is able to interact with the fusion peptide comprising a peptide of a protein of 
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the invention, the complex is able to direct expression of the reporter gene. In this 
way the nucleic acid molecules and the encoded peptide can be used to identify 
peptides and proteins interacting with the polypeptide described above. It is 
apparent to the person skilled in the art that this and similar systems may then 
further be exploited for the identification of inhibitors or antagonists of the 
polypeptide. 

Once the transacting factor is identified, modulation of its binding to or regulation 
of expression of the polypeptide described above can be pursued, beginning with, 
for example, screening for inhibitors against the binding of the transacting factor to 
the protein specified in accordance with the present invention. Inhibition of 
bacterial growth could then be achieved by applying the transacting factor (or its 
inhibitor). In addition, if the active form of the transacting factor is a dimer, 
dominant-negative mutants of the transacting factor could be made in order to 
inhibit its activity. 

Thus, the present invention also relates to the use of the polypeptide as defined 
above for the identification of antagonists or inhibitors of a polypeptide essential 
for bacterial growth or survival. 

In another embodiment, the present invention relates to a method for designing an 
improved antagonist or inhibitor for the treatment of a bacterial infection or 
disorder or disease related to a bacterial infection comprising the steps of 

(a) identification of the binding site of an antagonist or inhibitor to the 
polypeptide ygbB, yfhC, yacE, ychB, yejD, yrfl, yggJ, yjeE, yiaO, yrdC, 
yhbC, ygbP, ybeY, gcpE, kdtB, pfs, ycaJ, b1808 t yeaA.yagF, b1983, yidD, 
yceG and/or yjbC, the sequence of said genes being shown in Fig. 1, or 
identified according to the method of the present invention, by site-directed 
mutagenesis and chimeric polypeptide studies; 

(b) molecular modeling of both the binding site of said antagonist or inhibitor 
and the structure of said polypeptide; and 

(c) modification of said antagonist or inhibitor to improve its binding specificity 
or affinity for the polypeptide. 



Biological assays as described above or other assays such as assays based on 
crystallography or NMR may be employed to assess the specificity or potency of 
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the antagonist or inhibitor wherein the decrease of one or more activities of the 
polypeptide may be used" to monitor said specificity or potency. All techniques 
employed in the various steps of the method of the invention are conventional or 
can be derived by the person skilled in the art from conventional techniques 
without further ado. 

For example, identification of the binding site of said antagonist or inhibitor by site- 
directed mutagenesis and chimerical protein studies can be achieved by 
modifications in the (poly)peptide primary sequence that affect the antagonist's or 
inhibitor's affinity; this usually allows to precisely map the binding pocket for the 
drug. Identification of binding sites may be assisted by computer programs. Thus, 
appropriate computer programs can be used for the identification of interactive 
sites of a putative antagonist or inhibitor and the polypeptide of the invention by 
computer assisted searches, for complementary structural motifs (Fassiria, 
Immunomethods 5 (1994), 114-120). 

As regards step (b). the following protocols may be envisaged: Once the effector 
site for antagonists or inhibitors has been mapped, the precise residues 
interacting with different parts of the antagonists or inhibitors can be identified by 
combination of the information, obtained from mutagenesis studies (step (a)) and 
computer simulations of the structure of the binding site provided that the precise 
three-dimensional structure of the antagonists or inhibitors is known (if not, it can 
be predicted by computational simulation). If said antagonist or inhibitor is itself a 
peptide, it can be also mutated to determine which residues interact with others in 
the above-mentioned polypeptide essential for bacterial growth and survival. 
Finally, in step (c) the antagonist or inhibitor can be modified to improve its 
binding affinity or its potency and specificity. If, for instance, there are electrostatic 
interactions between a particular residue of an polypeptide as defined above and 
some region of an antagonist or inhibitor molecule, the overall charge in that 
region can be modified to increase that particular interaction. Furthermore, the 
three-dimensional and/or crystallographic structure of inhibitors or antagonists of 
the polypeptide of the invention can be used for the design of peptidomimetic 
inhibitors or antagonists, e.g. in combination with said polypeptide (Rose. 
Biochemistry 35 (1996), 12933-12944; Rutenber, Bioorg. Med. Chem. 4 (1996), 
- 1545-1558). 
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Potential antagonists/inhibitors include antisense molecules. Antisense 
technology can be used to control gene expression through antisense DNA or 
through triple-helix formation. Antisense techniques are discussed, for example, in 
Okano, J. Neurochem. 56 (1991), 560; Oligodeoxynucleotides as Antisense 
Inhibitors of Gene Expression. CRC Press. Boca Raton. FL (1988). Triple helix 
formation is discussed in, for instance. Lee, Nucl. Acids Res. 6 (1979). 3073; 
Cooney, Science 241 (1988). 456; and Dervan, Science 251 (1991). 1360. The 
methods are based on binding of a polynucleotide to a complementary DNA or 
RNA. 

For example, the 5' coding portion of a polynucleotide that encodes the mature 
polypeptide as described above may be used to design an antisense RNA ' 
oligonucleotide of from about 10 to 40 base pairs in length. A DNA oligonucleotide 
is designed to be complementary to a region of the gene involved in transcription 
thereby preventing transcription and the production of the protein. The antisense 
RNA oligonucleotide hybridizes to the mRNA and blocks translation of the mRNA 
molecule into receptor polypeptide. As indicated, antagonist or inhibitor e.g. 
polyclonal and monoclonal antibody 'according -to the teachings of the present 
invention can be raised according to the methods disclosed in Tartaglia, J. Biol. 
Chem. 267 (1992), 4304-4307; Tartaglia, Cell 73 (1993). 213-216. and PCT 
Application WO 94/091 37. 

Antibodies may be prepared by any of a variety of methods using immunogens of 
the polypeptide described above. As indicated, such immunogens include the full 
length polypeptide (which may or may not include the leader sequence) and 
fragments such as the ligand binding domain, the extracellular domain and the 
intracellular domain. These antibodies can be monoclonal antibodies, polyclonal 
antibodies or synthetic antibodies as well as fragments of antibodies, such as 
Fab*, Fv, F(ab') 2 , disulphide-bridged Fv or scFv fragments, etc. Monoclonal 
antibodies can be prepared, for example, by. the techniques as originally 
described in Kohler and Milstein, Nature 256 (1975). 495, and Galfre, Meth. 
Enzymol. 73 (1981), 3, which comprise the fusion of mouse myeloma cells to 
spleen cells derived from immunized mammals. Furthermore, antibodies or 
fragments thereof to the aforementioned peptides can be obtained by using 
methods which are described, e.g., in Harlow and Lane "Antibodies. A Laboratory 
Manual", CSH Press, Cold Spring Harbor. 1988. 
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The antagonists or inhibitors isolated by the above methods also serve as lead 
compounds for the development of analog compounds. The analogs should have 
a stabilized electronic configuration and molecular conformation that allows key 
functional groups to be presented to the receptor in substantially the same way as 
the lead compound. In particular, the analog compounds have spatial electronic 
properties which are comparable to the binding region, but can be smaller 
molecules than the lead compound,, frequently having a molecular weight below 
about 2 kD and preferably below about 1 kD. Identification of analog compounds 
can be performed through use of techniques such as self-consistent field (SCF) 
analysis, configuration interaction (CI) analysis, and normal mode dynamics 
analysis. Computer programs for implementing these techniques are available; 
e.g., Rein, Computer-Assisted Modeling of Receptor-Ligand Interactions (Alan 
Liss, New York, 1989). Methods for the preparation of chemical derivatives and 
analogues are well known to those skilled in the art and are described in, for 
example, Beilstein, Handbook of Organic Chemistry, Springer edition New York 
Inc., 175 Fifth Avenue, New York, N.Y. 10010 U.S.A. and Organic Synthesis, 
Wiley, New York, USA. Furthermore, said derivatives and analogues can be 
tested for their effects according to methods known in the art. Furthermore, 
peptidomimetics and/or computer aided design of appropriate derivatives and 
analogues can be used, for example, according to the methods described above. 

The inhibitor or antagonist identified by the above-described method may prove 
useful as a pesticide, and/or antibiotic. The inhibitors and antagonists of the 
present invention preferably have a specificity at least substantially identical to the 
binding specificity of the natural ligand or binding partner of the polypeptide 
described above. An antagonist or inhibitor can have a binding affinity to said 
polypeptide of at least 10 5 M" 1 ; preferably higher than 10 7 M' 1 and advantageously 
up to 10 10 M"\ 'In a preferred embodiment, an inhibitor, e.g. suppressive antibody, 
has an affinity of at least about 10* 7 M, preferably at least about 10" 9 M and most 
preferably at least about 10" 11 M; and the antagonist has an affinity of less than 
about 10' 7 M, preferably less than about 10* 9 M and most preferably in order of 10" 
11 M. 
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in the case of nucleic acid molecules it is preferred that they have a binding 
affinity to those encoding the amino acid sequences encoded in any one of SEQ 
lb NOS: 16 to 39 of at most 2-, 5- or 10-fold less than an exact complement of 20 
consecutive nucleotides of the above described nucleic acid molecules. 

In another embodiment, the present invention relates to a method for producing a 
therapeutic agent comprising synthesizing the above-described antagonist or 
inhibitor. 

Preferably, the compound identified according to the above described method or 
its analog or derivative is further formulated in a therapeutically active form or in a 
form suitable for the application against bacterial infections or diseases related to 
such an infection. For example, it can be combined with a pharmaceutical^ 
acceptable carrier known in the art. Thus, the present invention also relates to a 
method of producing a (therapeutically effective) composition comprising the 
steps of one of the above described methods of the invention and combining the 
compound obtained or identified in the method of the invention or an analog or 
derivative thereof with a pharmaceutically acceptable carrier. 

Also, the present invention relates to a composition comprising the antagonist or 
inhibitor mentioned above. As is evident from the above, the present invention 
generally relates to compositions comprising at least one of the aforementioned 
antagonists or inhibitors, which may be nucleic acid molecules, proteins or 
antibodies. Advantageously, said composition is for use as a medicament, a 
diagnostic means, or a kit. 

The term "composition", as used in accordance with the present invention, 
comprises at least one small molecule or molecule as identified herein above, 
such as a protein, an antigenic fragment of said protein, a fusion protein, a nucleic 
acid molecule and/or an antibody as described above and, optionally, further 
molecules, either alone or in combination, like e.g. molecules which are capable 
of optimizing antigen processing, cytokines, immunoglobulins, lymphokines or 
CpG-containing DNA stretches or, optionally, adjuvants. The composition may be 
in solid, liquid or gaseous form and may be', inter alia, in form of (a) powder(s), (a) 
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tablet(s), (a) solution(s) or (an) aerosol(s). In a preferred embodiment, said 
composition comprises at least two, preferably three, more preferably four, most 
preferably five differentially synthesized proteins. 

The antagonists and inhibitors of the invention appear to function against gene- 
products which are essential in several strains or genera of bacteria. Accordingly, 
the above-described antagonists and inhibitors may be used to inhibit the growth 
of a wide spectrum of bacteria. The above described antagonists or inhibitors may 
be used to slow, stop, or reverse bacterial growth. Thus, the present invention 
also relates to a method of producing a therapeutic agent comprising the steps of 
.the methods described hereinbefore and synthesizing the antagonist or inhibitor 
obtained or identified as described above or an analog or derivative thereof, 
preferably in an amount sufficient to provide said agent in a therapeutically 
effective amount to a patient. 

Compounds identified by the above methods or analogs are formulated for 
therapeutic use as pharmaceutical compositions. The compositions can also 
include, depending on the formulation desired, pharmaceutical^ acceptable,- 
usually sterile, non-toxic carriers or diluents, which are defined as vehicles 
commonly used to formulate pharmaceutical compositions for animal or human 
administration. The diluent is , selected so as not to affect the biological activity of 
the combination. Examples of such diluents are distilled water, physiological 
saline. Ringer's solutions, dextrose solution, and Hank's solution. In addition, the 
pharmaceutical composition or formulation may also include other carriers, 
adjuvants, or nontoxic, nontherapeutic, nonimmunogenic stabilizers and the like. 
A therapeutically effective dose refers to that amount of protein or its antibodies, 
antagonists, or inhibitors which ameliorate the symptoms or condition. 
Therapeutic efficacy and toxicity of such compounds can be determined by 
standard pharmaceutical procedures in cell cultures or experimental animals, e.g.. 
ED50 (the dose therapeutically effective in 50% of the population) and LD50 (the 
dose lethal to 50% of the population). Jhe dose ratio between therapeutic and 
toxic effects is the therapeutic index, and it can be expressed as the ratio, 
LD50/ED50. 

Compositions comprising such carriers can be formulated by well known 
conventional methods. These pharmaceutical compositions can be administered 
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to the subject at a suitable dose.. Administration of the suitable compositions may 
be effected by different ways. e.g.. by intravenous, intraperitoneal, subcutaneous, 
intramuscular, topical, intradermal, intranasal or intrabronchial administration. The 
dosage regimen will be determined by the attending physician and clinical factors. 
As is well known in'th'e medical arts, dosages for any one patient depends upon 
many factors, including the patient's size, body surface area, age. the particular 
compound to be administered, sex, time and route of administration, general 
health, and other drugs being administered concurrently. Proteinaceous 
pharmaceutical^ active matter may be present in amounts between 1 ng and 10 
mg'per dose; however, doses below or above this exemplary range are 
envisioned, especially considering the aforementioned factors. Administration of 
the suitable compositions may be effected by different ways, e.g., by intravenous, 
intraperitoneal, subcutaneous, intramuscular, topical or intradermal administration. 
If the regimen is a continuous infusion, it should also be in the range of 1 ug to 10 
mg units per kilogram of body weight per minute, respectively. Progress can be 
monitored by periodic assessment. The compositions of the invention may be 
administered locally or systemically. Administration will generally be parenterally. 
e.g., intravenously. The compositions of the invention may also be administered 
directly to the target site, e.g., by biolistic delivery to an internal or external target 
site or by catheter to a site in an artery. Preparations for parenteral administration 
include sterile aqueous or non-aqueous solutions, suspensions, and emulsions. 
Examples of non-aqueous solvents are propylene glycol, polyethylene glycol, 
vegetable oils such as olive oil, and injectable organic esters such as ethyl oleate. 
Aqueous carriers include water, alcoholic/aqueous solutions, emulsions or 
suspensions, including saline and buffered media. Parenteral vehicles include 
sodium chloride solution, Ringer's dextrose, dextrose and sodium chloride, 
lactated Ringer's, or fixed, oils. Intravenous vehicles include -fluid and nutrient 
replenishes, electrolyte replenishes (such as those based on Ringer's dextrose), 
and the like. Presetvatives and other additives may also be present such as. for 
example, antimicrobials, anti-oxidants, chelating agents, and inert gases and the 
like. Furthermore, the pharmaceutical composition of the invention may comprise 
further agents such as interleukins. interferons and/or CpG-containing ON A 
stretches, depending on the intended use of the pharmaceutical composition. 
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In another embodiment, the present invention relates to a kit comprising at least 
one of the aforementioned "antagonists or inhibitors of the invention. The kit of the 
• invention as well as the composition may in a preferred embodiment contain 
further ingredients such as selection markers, antibiotics, cytokines and 
components for simplifying or supporting the treatment of bacterial infections or- 
disorders or diseases related to bacterial infections. The. kit of the invention may 
advantageously.be used for carrying out the method of the invention and could 
be, inter alia, employed in a variety of applications referred to herein, e.g., in the 
diagnostic field or as research tool. The parts of the kit of the invention can be 
packaged individually in vials or in combination in containers or multiconiainer 
units. Manufacture of the kit follows preferably standard procedures which are 
known to the person skilled in the art. The kit or its ingredients according to the 
invention can be used in antibacterial therapies, for example, for any of the above 
described methods for detecting further inhibitors and antagonists essential for 
bacterial growth and survival. The kit of the invention and its ingredients are 
expected to be very useful for the healing and protection of animals and humans 
suffering from a bacterial- infection. 

The present invention also relates to a method for treating or preventing bacterial 
infections or diseases or disorders, related to bacterial infections comprising the 
step of administering to a subject in need thereof an antagonist or inhibitor 
identified herein above, optionally comprised in a pharmaceutical composition of 
the invention. 

In another embodiment the present invention relates to the use of a polypeptide 
encoded by the gene as identified above or a fragment, derivative or ortholog 
thereof or of any of said genes for the identification of an antagonist or inhibitor of 
said polypeptide fragment, derivate or ortholog or said gene. 

In a further embodiment the present invention relates to the use of said 
polypeptide, the therapeutic agent produced according to/ the invention, the 
antagonist or inhibitor obtained or identified by the method or use according to the 
invention for the preparation, of a pharmaceutical composition for the treatment of 
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(a) bacterial infection(s). disorder(s) and/or disease(s) related to bacterial 
infections. 

In another embodiment the present invention relates to a method for treating or 
preventing bacterial .infections or diseases or disorders related to bacterial- 
infections comprising the step of administering to a subject in need thereof an 
antagonist or inhibitor identified herein above, optionally comprised in the 
pharmaceutical composition according to the present invention. 

In a further embodiment the present invention relates to the use of the above- 
described polypeptide, a fragment, derivative or ortholog thereof or of any of said 
genes for screening for polypeptides interacting with said polypeptide using 
protein-protein interaction technologies, and/or for validating such interaction as 
being essential for bacterial survival and/or for screening for antagonists or 
inhibitors of such interaction. 

In a further embodiment, the present invention relates to the use of the above- 
described polypeptide, a fragment, derivative or ortholog thereof or of any of said 
genes for screening of polypeptide for polypeptide binding to said polypeptide, 
and/or for validating the peptides binding to said polypeptide as preventing growth 
of bacteria or being lethal to bacteria upon expression of said polypeptides in said 
bacteria, and/or for screening for small molecules competitively displacing said 
peptides. 

In another embodiment the present invention relates to the use of a conditional 
mutant of a gene as described above or a fragment, derivative or ortholog thereof 
or of surrogate ligands against said gene expressed in bacteria to induce a lethal 
phenotype in bacteria and/or for the analysis of said bacteria for surrogate 
markers by comparison of RNA or protein profiles in said bacteria with RNA or 
protein profiles in wild type bacteria, and/or the use of said surrogate markers for 
the identification of antagonists of the essential function of said gene. 
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in another embodiment the present invention relates to a method for identifying or 
isolating a surrogate marker comprising the steps as described in the above- 
recited method of the present invention. 

In a further embodiment the present invention relates to a method for identifying 
or isolating a surrogate marker comprising the steps of 

(a) inducing a lethal phenotype in bacteria representing a conditional mutant of 
a gene selected from the group consisting of ygbB, yfhC, yacE, ychB, yejD, 
yrfl, yggJ, yjeE, yiaO, yrdC, yhbC, ygbP, ybeY, gcpE, kdtB, pfs, ycaJ, 
b1808, yeaA, yagF, b1983, yidD, yceG and/or yjbC; and 

(b) 1 analysing said bacteria comparing the RNA or protein profile of said 

bacteria with wild type bacteria. 

The invention also relates to the above recited genes and polypeptides and 
fragments, derivatives and orthologs thereof. 

These and other embodiments are disclosed and encompassed by the description 
and examples of the present invention. Further literature concerning any one of 
the methods, uses and compounds to be employed in accordance with the 
present invention may be retrieved from public libraries, using for example 
electronic devices. For example the public database "Medline" may be utilized 
which is available on the Internet, for example under 
http://www.ncbi.nlm.nih.gov/PubMed/medline.html. Further databases and 
addresses, such as http://www.ncbi.nlm.nih.gov/, 

http://www.infobiogen.fr/, http://www.fmi.ch/biology/research_tools.html, http://ww 
w.tigr.org/, are known to the person skilled in the art and can also be obtained 
using, e.g., http://www.lycos.com. An overview of patent information in 
biotechnology and a -survey of relevant sources of patent information useful for 
retrospective searching and for current awareness is given in Berks, TIBTECH 12 
(1994), 352-364. 

the present invention is further illustrated by reference to the following non- 
limiting, examples. 
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Unless stated otherwise in the examples, all recombinant DNA techniques are 
performed according to protocols as described in Sambrook et al. (1989), 
Molecular Cloning : A Laboratory Manual. Cold Spring Harbor Laboratory Press, 
NY or in Volumes 1 and 2 of Ausubel et al. (1994), Current Protocols in Molecular 
Biology, Current Protocols. Standard materials and' methods for plant molecular 
work are described in Plant Molecular Biology Labfase (1993) by R.D.D: Croy, 
jointly published by BIOS Scientific Publications Ltd. (UK) and Blackwell Scientific 
Publications (UK). 

Brief description of the figures 

Figure 1: Sequences of the essential bacterial genes identified according to 
the method described in the examples 

Figure 2: PCR strategy and the position of primers used 

Figure 3: Sequence comparison table of essential E.coli genes with proposed 
orthologs from various bacteria. Unfinished genomes are indicated 
by. asterisk. Complete genomes were analysed using BlastP2. 
Unfinished . genomes were analysed with TBIastN. Orthologous 
sequences can be accessed at the respective WWW links as 
indicated in the footnotes. 

Figure 4: Multiple Sequence Alignment (MSA) .of E. coli gene ygbB with 
orthologs in 5 different bacterial organisms including homology 
score. Similar MSA with similar results have been created for all 22 
essential bacterial genes. 

Example 1 



An k automated BLASTP-based genome comparisons to identify £ coli FUN genes 
resulted in the following list of 65 candidate genes which are conserved between 
£. coli, S. subtilis, H influenzae, H. pylori, M. tuberculosis, Ch. trachomatis, B. 
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burgdorferi, T. pallidum. S. pneumoniae, S. aureus. E. faecalis, P. aeruginosa, B. 
pertussis and which were further analysed: 
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a : double mutants were created when the respective genes were paralogues in 
E.coli 



Creating in-frame deletions of E. coli genes 

The subsequent description of the construction of deletion mutants was 
carried out essentially equal for these 77 candidate genes. Particular details will 
exemplarily be described for one gene which gave rise to be essential (yfhC) and 
one which was non-essential (yggV). 
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1) Principle of the PCR-procedure and primer-design for in frame deletions: 

Unless an overlapping ORF exists, primers dgenX2 and dgenX3 are 
designed to delete the entire ORF from ATG to STOP, e.g.: 
ATGgttataaatttggagtgtgaaggttattgcgtgTAA (SEQ ID NO: 1) (see figure). The 5'- 
ends of primers dgenXI and dgenX4 contain random nucleotides followed 
preferably by a BamHI site (dgenXI) or a' Sail site (dgenX4) for cloning into 
plasmid pK03 (Link et al (1997), J Bac 179: 6228-6237). In most mutants, primers 
dgenX2 and dgenX3 contain a 33 bp tag sequence called "Church-tag". 

Church-tag forward direction: 5'-gttataaatttggagtgtgaaggttattgcgtg-3' (SEQ ID 
NO: 2) 

Church-tag reverse direction: 5'-cacgcaataaccttcacactccaaatttataac-3' (SEQ ID NO: 
3) 

This tag is used for a subsequent PCR in which the 5'- and 3'- flanking DNA- 
fragments of the deletion construct are assembled. 

In the few constructs lacking the "Church-tag", the primers dgenX2 and dgenX3 
carry at their 5'-ends 5 random nucleotides followed by a restriction site 
(preferably EcoRI) which by its positioning creates the in frame deletion. 

Oligos cgenXI and cgenX2 are used for the verification of the chromosomal 
situation (wild type or deletion) after the replacement procedure (Fig. 2). 

Primers for the respective candidate genes were designed as follows: 
dyfhCI : 5'-GATCGGATCCAAATTCCAGTTAGCCATGATGCGGTC-3' 
(SEQ ID. NO: 4) 

dyfhC2: 5'-CACGCAATMCCTTCACACTCCAAATTTATAACCATTATA 

CACGGACGCTATGC-3' (SEQ ID NO: 5) 
dyfhC3: 5'- 

GTTATAAATTTGGAGTGTGAAGGTTATTGCGTGACGGATTAATT 

TTGrrTCTCTT-S" (SEQ ID NO: 6) 
dyfhC4: 5'-GATCGTCGACGCGCTCGATATCACCGATGAACAACCG-3' 

(SEQ ID NO: 7) 

cyfhCI : 5'-CAATCCGCTGCTTTATTTCTGTCAG-3' (SEQ ID NO: 8) 
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cyfhC2: 5 '-TTAT AAC G AAATC AAC G G G AAAC CT- 3' (SEQ ID NO: 9) 

dyggVI: ' 5'-GATCGGATCCCTCTAAAAAATAAGGAATTAAAGG-3' 
(SEQ ID NO: 10) 

dyggV2: 5'-CACGCAATAACCTTCACACTCCAAATTTATAACCATAGGATAC 

CTAATTAATTAAC-3' (SEQ ID NO: 11) 
dyggV3: 5'-GTTATAAATTTGGAGTGTGAAGGTTATTGCGTGAAGAGCGCC 

ATTTCCCACCGT-3' (SEQ ID NO: 12) 
dyggV4: 5'-GATCGTCGACTCATATTGCTGATAACCCGCTGCGGT-3' 

(SEQ ID NO: 13) 

cyggVI : S'-GTTGACGGCCAGGCCAACAGTCAT-S' (SEQ ID NO: 1 4) 
cyggV2: ' 5 , -ATAACCCTGGGCAAtCGCCTCG-3■ (SEQ ID NO: 15). . 



Example 2 

Construction of the DN A-fragmerits comprising the deletion 

The 5'- and the 3'-flarrking DNA fragments are PCR amplified in a total 
volume of 50 ul as follows: - 



Chromosomal DNA from £ coli strain MG1655 (100 ng/ul): 

final cone: 1. ng/ul 
final cone: 1x 
final cone: 500 nM 
final cone: 500 nM 
: final cone: 5 U/100 pi 
final cone,: 250 pM 
to adjust volume to 50 ul 



10'Pwo-buffer 
dgenX1/3(10jiM) 
dgenX2 (4) (10 nM) 
Pwo-Polymerase 
dNTPs (25 mM) 
H z O 



PCR conditions: 

4' 94 °C 
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30 cycles: 30" 94 °C, 30" 44 °C, V 72 °C ' 
5'72°C 

The PCR products are' then purified with the High Pure PCR Purification Kit 
(Boehringer) to remove salts and enzyme (elute in 50 ul H 2 0). Alternatively, if 
PCR products contain prominent impurities, the respective fragment must be 
purified by agarose gel extraction (Gene Clean, Dianova) before the fragment 
assembly. 

Assembly PCR 

Equal amounts of 5'-. and 3'-fragment are applied as template DNA. in general a 
volume applied for gel electrophoresis giving an intense band .is. o.k. The total 
reaction volume is 100 p!. For the assembly the "outer" primers dgenXI and 
dgenX4 were used. 



S'-Fragment 
3'-Fragment 
10*Pwo-buffer 
dgenXI (10 uM) 
dgenX4(10uM) 
Pwo-Pol (Boehringer) 
dNTPs (25mM) . 
H 2 0 



. approx. 1 0 ng ' 
approx. 10 ng 
final cone: 1x 

final cone: 500 nM (50 pmol/100 pi) 
final cone: 500 nM (50 pmol/100 pi) 
final cone: 5 U/1 00.pl 
final cone: 250 pM 
add to 100 (.U 



PCR conditions: 

4' 94 °C 

10 cycles: 30" 94 °C, 30" 44 °C, V 72 °C 
25 cycles: 30" 94 °C, 30" 44 °C, 3' 72 °C 
5' 72 °C 



The success of the PCR is checked by agarose gel electrophoresis. The 
assembled PCR product is purified with the High Pure PCR Purification Kit and 
the complete eluate of 50 ul is over-night digested with BamHI and Sail in a 
volume of 60 ul. After gel electrophoresis the digested product is purified with 
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3) Plasmid-DNA from 4 ml over-night culture is prepared using a QIAgen 
Miniprep Kit and a double restriction . analysis with BamHI/Sall and 
EcoRI/HindllMs performed to verify the clones. 

Protocol referring to the construction of assembly products by a 
restriction site: 

The 5'- and the 3'-fragments are PCR amplified as described above. The PCR 
products are' purified with the High Pure PCR Purification Kit (Boehringer) to 
remove- salts and enzyme and 5 to 10 jil are digested over night using the 
restriction site creating the deletion (primers 2 and 3; mostly EcoRI) in a total 
volume of 30 pi. The restriction products are again purified with the High Pure 
PCR Purification Kit to remove nucleotides, salts and enzyme. (Alternatively: 
Following preparative agarose gel electrophoresis the cut fragments are isolated 
using Gene Clean (Dianova) and eluted in a volume of 25 The cut fragments 
(3-6 jjl each) are ligated in a volume of 10-15 ^1 using T4-DNA ligase for 2 hours 
at room temperature. 5 ^1 of this ligation mix is directly used as a template for a 
second PCR. In this PCR, the assembled fragments are amplified using primers 
dgenXI and dgenX4. The reaction is set up as described above with two 
exceptions: 1) The total reaction volume is 100 pi and 2) the extension step at 72 
°C lasts 3\ 

Example 3 

The chromosomal exchange strategy 

(Link et al (1997), J Bac 179: 6228-6237) 

Cointegration: 

Cointegration = integration of a plasmid into the chromosome by a recombination 
event 

The pK03 derivative is transformed into MG1655 or any recA+ strain 
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Day 1 

The strain is grown at 30 °C in LB containing 20 ug/ml chloramphenicol (LB- 
Cam20) to an OD 6 oo of -1.0: Afterwards, perform 10-fold serial dilutions in the 
same medium (down to 10' 7 ). For the following plating use prewarmed LB-Cam20 
agar plates. Plate 100 of dilutions 10"* and 10' 5 for incubation at 44 °C and 100 
\x\ of dilutions 1 0' 6 and 1 0" 7 for incubation at 30 °C. 

Day 2 

Following incubation at the respective temperature, determine -the factor 
c.f.u.44 °C/c.f.u.30 °C (c.f.u/= colony forming units). This factor for pK03 without 
insert is in the range 1*10' 4 to 5M0* 4 and should be significantly larger in the case 
of successful cointegratiori'. Purify 8 randomly chosen clones from the 44 °C plate 
twice on LB-Cam20 agar plates at 44 °C (during Day 2 and over night to Day 3). 
Optionally, confirm the clones for their identity as cointegrates by colony-PCR. 

Resolution and counter-selection: > 

Resolution = resolution of the cointegrate resulting in a self replicative plasmid by 
a second recombination event 

Counter-selection = selection against the presence of plasmid in the cell 
Day 3 

Pool single colonies from each of the 8 cointegrates in 100 jxl LB and use this 
suspension as an inoculum for 10 ml.LB+5 %sucrose. After growth at 30 °C (8 to 
10 hours during a day is sufficient) 10-fold serial dilutions are performed .and 100 
jil of dilutions 10* 4 , 10" 5 , and 10" 6 are plated onto LB agar+5 % sucrose and grown 
over night at 30 °C. 

Day 4 

50 single colonies are replica streaked on LB+Cam20 and LB+5 % sucrose to test 
for the loss of plasmid. 
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Example 4 

Testing for essentiality of FUN genes of £. coli and interpretation of the 

results 

Day 5 

The clones sensitive to chloramphenicol are then tested for their genotype (wild 
type versus in-frame deletion) by colony-PCR using primers cgenXI and cgenX2 
(10-48 clones). 

•In the case of the gene yfhC out of 48 clones tested only wild type situation on the 
chromosome could be detected. 

In the case of the gene yggV out of 48 clones 16 (= 33 %) revealed a PCR 
product with a size indicative for the deletion situation on the chromosome. 

Are 48 clones revealing no mutant enough to claim a gene . as essential? This 
question can be answered by asking for the number of clones that have to be 
tested to get a confidence of e.g. 99 % that really no mutants are present in an 
infinite, number of clones. Provided a hypothesis H 0 means that only the wild type 
genotype is viable and hypothesis Hi means that a fraction (1-x) of mutants is 
allowed to occur together with the wild type (x) among a population of clones (x + 
(1-x)), then the probability to make the wrong decision (decision for H 0 whereas 
is true) can be calculated as n 

(1) " x n /(1+x n ) 

where x is the fraction of wild type clones and n is the number of clones tested. 
The confidence niveau a to make the wrong decision (error probability) is given by 

(2) a>x n /(1+x n ) 
thereby resulting in 

(3) n>ln(a/(1-a))/ln(x) ' 

for the number of clones that have to be tested to prove or disprove hypothesis 
H 0 . 
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If the average probability for obtaining wild type clones (x) in a replacement 
experiment is 70 % (experimentally determined for 43 non-essential genes out of 
65 candidate genes), then, after testing of 26 clones which reveal a wild type 
genotype an uncertainty of 0.01 % error probability (a) remains that the claiming 
of a gene as essential could be wrong. Even if the rate of obtaining wild types (x) 
is set to 85 % (a value which occurs with a frequency of 10 % for replacement 
experiments with non-essential genes), then, by testing 32 clones (which was 
performed in every experiment giving rise to an essential gene) an error 
probability of only 0.6 % remains to chose the wrong hypothesis. 

Examle 5 

List of essential FUN genes obtained 

By the described method the following 24 genes were obtained which gave no 
deletion genotype and are therefore claimed to be essential: 



E. coli 


gene name GenBank# 


ygbB 


g1789103 


yfhC 


g17889H 


yacE 


g1 786292 


ychB 


g1 787459 


yejD 


g1788510 


yrfl 


g1789804 


yggJ 


g1789315 


yjeE 


g1790610 


yiaO 


g 1790004 


yrdC 


g2367210 


yhbC 


g1789561. 


ygbP 


g1789104 


ybeY 


g1 786880 
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g1 788863 


kdtB 


a1790065 


rife 


n17Rfi^4 


\/r*a 1 


□17871 19 


hlftOR 
U I ouo 


a1788110 


ye a A 




yagF 


g 1786464 


b1983 


g 1788294 


yidD , 


g140861 


yceG 


g 1787339 


yjbC 


g396357 
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CLAIMS 

1. A method for identifying an antagonist or inhibitor of the expression of a 
gene encoding a polypeptide essential for bacterial growth or survival 
wherein said gene is selected from the group consisting of ygbB t yfhC, 
yacE, ychB, yejD, yrfl, yggJ, yjeE, yiaO, yrdC, yhbC, ygbP, ybeY, gcpE, 
kdtB, pfs, ycaJ, b1808, yeaA, yagF, b 1-983, yidD, yceG and/or yjbC, the 
sequence of said genes being shown in Fig. 1 , or a fragment, derivative or 

\ ortholog thereof, said method comprising the steps of 

(a) testing a candidate antagonist or inhibitor or a sample comprising 
a plurality of said candidate antagonists or inhibitors for the 
inhibition or reduction of transcription of said gene or a fragment 
or derivative thereof; or 

(b) testing a candidate antagonist or inhibitor or a sample comprising 
. a plurality of said candidate antagonists or inhibitors for the 

inhibition or reduction of translation of mRNA transcribed from 
said gene or a fragment or derivative thereof; and 
. (c) identifying an antagonist or inhibitor or a sample comprising a 
plurality of said candidate antagonists or inhibitors that tests 
positive in step (a) and/or (b). 

2. A method for testing a candidate antagonist or inhibitor of a polypeptide or 
a mRNA essential for bacterial growth or survival encoded by a gene 
selected from the group consisting of ygbB, yfhC, yacE, ychB, yejD, yrfl, 
yggJ, yjeE, yiaO, yrdC.yhbC, ygbP, ybeY, gcpE, kdtB, pfs, ycaJ, b1808, 
yeaA, yagF, b1983, yidD, yceG and/or yjbC, the sequence of said genes 
being shown in Fig. 1, or a fragment, derivative or ortholog thereof 
comprising the steps of 

(a) contacting a bacterial cell with a candidate antagonist or inhibitor 
or a sample comprising a plurality of said' candidate antagonists 
or inhibitors; and 

(b) testing whether said contacting leads to cell growth inhibition 
and/or cell death. 
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3. . A method for testing a candidate antagonist or inhibitor of the function of a 

gene essential for bacterial growth or survival wherein said gene is 
selected from the group consisting of ygbB, yfhC, yacE, ychB, yejD, yrfl, 
yggJ, yjeE, yiaO, yrdC, yhbC t ygbP, ybeY, gcpE, kdtB, pfs, ycaJ, b1808 t 
yeaA, yagF, b1983, yidD, yceG and/or yjbC, the sequence of said genes 
being shown in Fig. 1, or a fragment, derivative or ortholog thereof, 
comprising the steps of 

(a) contacting a bacterial cell comprising said gene with a candidate 
antagonist or inhibitor or a sample comprising a plurality of said 
candidate antagonists or inhibitors; and 

(b) testing whether said contacting leads. to .cell -growth inhibition 
and/or cell death. 

4. The method of any one of claims 1 to 3 further comprising identifying an 
antagonist or inhibitor, optionally from said sample of candidate 
antagonists or inhibitors. 

5. The method of any one of claims 1 to 4 wherein said inhibitor or 
antagonist is further improved by peptidomimetics or by applying phage 
display or combinatorial library technique step(s). 

6. A method for designing an improved antagonist or inhibitor for the 
treatment of a bacterial infection or disorder or disease related to, a 
bacterial infection comprising the steps 

(a) identification of the binding site of an antagonist or inhibitor to the 
polypeptide ygbB, yfhC, yacE, ychB, yejD, yrfl, yggJ, yjeE, yiaO, 
yrd.C. yhbC, ygbP, ybeY, gcpE, kdtB, pfs, ycaJ, b1808, yeaA, 
yagF, b1 983, yidD, yceG and/or yjbC, the sequence of said genes 
being shown in Fig. 1, or obtained by or identified by the method 
of any one of claims 1 to 5 by site-directed mutagenesis and 
chimeric polypeptide studies; 

(b) molecular modeling of both the binding site of said antagonist or 
inhibitor and the structure of said polypeptide; and 
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(c) modification of said antagonist or inhibitor to improve its binding 
specificity or affinity for the polypeptide. 

7. An antagonist or inhibitor of the activity of a polypeptide encoded by a 
gene selected from the group consisting of ygbB, yfhC, yacE, ychB, yejD, 
yrfl, yggJ, yjeE, yiaO, yrdG, yhbC, ygbP, ybeY, gcpE, kdtB, pfs, ycaJ, 
b1808, yeaA, yagF, b1983, yidD, yceG and/or yjbC, the sequence of said 
genes being shown in Fig. 1 , or fragment, derivative or ortholog thereof or 
of the expression of a gene encoding said polypeptide or said fragment, 
derivative or ortholog or obtained by or identified by the method of any 
one of claims 1 to 6., . 

8. " A method for producing a therapeutic agent comprising synthesizing the 

antagonist or inhibitor identified, tested or designed according to the 
method of any one of claims 1 to 6 or the antagonist or inhibitor of claim 7 
or an analog or derivative thereof. 

9. A method for producing a composition comprising the steps of the method 
of any one of claims '1 to. 6 or synthesizing the antagonist or inhibitor of 
claim 7 and formulating said ^inhibitor or antagonist in a pharmaceutical^ 
acceptable form. 

10. A -composition comprising an antagonist or inhibitor of claim 7, the 
therapeutic agent produced by the method of claim 8 or the antagonist or 
inhibitor obtained by or identified in the method of any one of claims 1 to 6 
or produced according to claim 9 and optionally a pharmaceutical^ 
acceptable carrier. 

1 1 . the composition of claim 1 0 which is a pharmaceutical composition. 

12. The composition of claim 10 which is a kit. 

13. The composition of any one of claims 10 to 12 further' comprising an 
antibiotic and/or cytokine. 
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14. Use of a polypeptide encoded by a gene selected from the group 
consisting of ygbB, yfhC, yacE, ychB, yejD, yrfl, yggJ, yjeE, yiaO, yrdC, 
yhbC, ygbP, ybeY, gcpE, kdtB, pfs, ycaJ, b1808, yeaA, yagF, b1983, 
yidD, yceG and/or yjbC, the sequence of said genes being shown in Fig. 
1 , or a fragment, derivative or ortholog thereof or of any of said genes for 
the identification of an antagonist or inhibitor of the activity of said 
polypeptide or said fragment, derivative or ortholog or of the expression of 
a gene encoding said polypeptide or said fragment, derivative or ortholog. 

15. Use of an antagonist or inhibitor of claim 7, the therapeutic agent 
produced by the method of claim 8 or the antagonist or inhibitor obtained 
by or identified in the method of any one of claims 1 to 6 or produced 
according to claim 9 or identified by the use of any of the claims for the 
preparation of a pharmaceutical composition for the treatment of (a) 
bacterial infection(s), disorder(s) and/or disease(s) related to bacterial 
infections. 

16. A method for treating or preventing bacterial infections or diseases or 
disorders related to bacterial infections comprising the step of 

- administering to a subject in need thereof the antagonist or inhibitor 
obtained by or identified in the method of any one of claims 1 to 6 or. 
produced according to claim 9 optionally comprised in the pharmaceutical 
composition according to claim 1 1 . 

17. Use of a polypeptide encoded by a gene selected from the group 
consisting of ygbB, yfhC, yacE, ychB t yejD, yrfl, yggJ, yjeE, yiaO, yrdC, 
yhbC, ygbP, ybeY, gcpE, kdtB, pfs, ycaJ, b1808, yeaA, yagF, M983, 
yidD, yceG and/or yjbC, the sequence of said genes being shown in Fig. 
1, or a fragment, derivative or ortholog thereof or any of said genes for 
screening for polypeptides interacting with said polypeptide using protein- 
protein interaction technologies, and/or 'for validating such interaction as 
being essential for bacterial survival and/or for screening for antagonists 
or inhibitors of such interaction. 
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18. Use of a polypeptide encoded by a gene selected from the group 
consisting of ygbB, yfhC, yacE,.ychB, yejD, yrfl, yggJ, yjeE, yiaO, yrdC, 
yhbC, ygbP, ybeY, gcpE t kdtB, pfs, ycaJ, b1808, yeaA, yagF, b1983, 
yidD, yceG and/or yjbC, the sequence of said genes being shown in Fig. 
1, or a fragment, derivative or ortholog thereof or any of said genes for 
screening of polypeptide for polypeptide binding to said polypeptide, 
and/or for validating the peptides binding to said polypeptide as 
preventing growth of bacteria or being lethal to bacteria upon expression 
of said polypeptides in . said, bacteria, and/or for screening for small 
molecules competitively displacing said peptides. 

19. Use of conditional mutants in a gene selected from the group consisting of 
ygbB, yfhC, yacE, ychB, yejD, yrfl, yggJ, yjeE, yiaO, yrdC, yhbC, ygbP, 
ybeY, gcpE, kdtB, pfs, ycaJ, b1808, yeaA, yagF, b1983, yidD, yceG 
and/or yjbC, the sequence of said genes being shown in Fig. 1, or a 
fragment, derivative or ortholog thereof or of surrogate ligands against 
said gene expressed in bacteria to induce a lethal phenotype in bacteria 
and/or for the analysis of said bacteria for surrogate markers by 
comparisonof RNA or protein profiles in said bacteria with RNA or protein 
profiles in wild type bacteria, and/or the use of said surrogate markers for 
the identification of antagonists of the essential function of said gene. 

20. A method for identifying or isolating a surrogate marker comprising the 
steps as described in claim 19. 

21. A method for identifying or isolating a surrogate marker comprising the 
steps of - 

(a) inducing a lethal phenotype in bacteria containing a conditional 
mutant of a gene selected from the group consisting of ygbB, yfhC, 
yacE, ychB, yejD, yrfl, yggJ, yjeE, yiaO; yrdC, yhbC, ygbP, ybeY, 
gcpE, kdtB, pfs, ycaJ, b1808 v yeaA, yagF, b1983, yidD, yceG and/or 
yjbC, the sequence of said genes being shown in Fig. 1 ; and 
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(b) analysing said bacteria comparing the RNA or protein profile of said 
bacteria with wild type bacteria. 
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1/15 
. Fig. 1 

1) ygbB 

atgcgaattggacacggttttgacgtacatgcctttggcggtgaaggcccaattatcattggtggcgtacgcattccttacgaaaa 

aggattgctggcgcattctgatggcgacgtggcgctccatgcgttgaccgatgcattgcttggcgcggcggcgctgggggatat 

cggcaagctgttcccggataccgatceggcatttaaaggtgccgatagccgcgagctgctacgcgaagcctggcgtcgtattc 

aggcgaagggttatacccttggcaacgtcgatgtcactatcatcgctcaggcaccgaagatgttgccgcacattccacaaatg 

cgcgtgtttattgccgaagatctcggctgccatatggatgatgttaacgtgaaagccactactacggaaaaactgggatttaccg 

gacgtggggaagggattgcctgtgaagcggtggcgctactcattaaggcaacaaaatga (SEQ ID NO: 16) 

2) yfhC 

atgcgccgcgcttttataaccggagttttctttttgtctgaagtcgaatttagccacgaatactggatgcgtcacgcgctgacgctg 

gcgaaacgtgcctgggatgagcgggaagtgccggtcggcgcggtattagtgcataacaatcgggtaatcggcgaaggctg 

gaaccgcccgattggtcgccatgatcccaccgcacatgcagaaatcatggccctgcggcagggtggtctggtgatgcaaaat 

tatcgtctgatcgacgccacgttgtatgtcacgcttgaaccatgtgtaatgtgtgccggagcgatgatccacagtcgcattggtcg 

cgtggtctttggtgcgcgtgacgcgaaaactggcgctgcgggatctttaatggatgtgctgcatcatccgggtatgaatcaccga 

gtggaaattacggaaggaatactggcggatgagtgcgcggcgttgctcagtgacttctttcgcatgcgccgccaggaaattaa 

agcgcagaaaaaagcgcaatcctcgacggattaa (SEQ ID NO: 17) 

3) yacE 

atgaggtatatagttgccttaacgggiaggcattggcagtggcaagagtaccgttgccaatgcgtttgctgatctcggaattaabg 
tcattgatgccgatattattgcgcgtcaggtggttgaaccaggtg^ 

tgattgctgctgatggaacattgcagcgccgggccttgcgcgagcggatcttcgccaacccggaagagaaaaactggcttaa 

cgccctgctgcatccgctgattcagc^agagacgcaacaccagatccagcaagctacttccccctatgtactgtgggttgtgc^ 

attgctggtagaaaactcactgtataaaaaagcgaatcgagtgcttgtggtggatgtcagcccagaaacgcaacttaagcgc 

accatgcagcgcgatgatgtaactcgcgagcatgtcgaacaaatccttgctgctcaggcaacgcgcgaagcccgccttgccg 

tggcagatgacgtcattgataataacggcgcaccggatgctatc^ 

ttgcgtcgcagtttgtctcacaggaaaaaccgtaa (SEQ ID NO: 18) 

4) ychB 

atgcggacacagtggccctctccggcaaaacttaatctgtttttatacattaccggtcagcgtgcggatggttaccacacgctgc 

aaacgctgtttcagtttcttgattacggcgacaccatcagcattgagcttcgtgacgatggggatattcgtctgttaacgcccgttg 

aaggcgtggaacatgaagataacctgatcgttcgcgcagcgcgattgttgatgaaaactgcggcagacagcgggcgtcttcc 

gacgggaagcggtgcgaatatcagcattgacaagcgtttgccgatgggcggcggtctcggcggtggttcatccaatgccgcg 

acggtcctggtggcattaaatcatctctggcaatgcgggctaagcatggatgagctggcggaaatggggctgacgctgggcg 

cagatgttcctgtctttgttcgggggcatgccgcgtttgccgaaggcgttggtgaaatactaacgccggtggatccgccagaga 

agtggtatctggtggcgcaccctggtgtaagtattccgactccggtgatttttaaagatcctgaactcccgcgcaatacgc 

aaggtcaatagaaacgttgctaaaatgtgaattcagcaatgattgcgaggttatcg.caagaaaacgttttcgcgaggttgatgc 

ggtgctttcctggctgttagaatacgccccgtcgcgcctgactgggacaggggcctgtgtctttgctgaatttgatacagagtctg 

aagcccgccaggtgctagagcaagccccggaatggctcaatggctttgtggcgaaaggcgctaatctttccccattgcacag 

agccatgctttaa (SEQ ID NO: 19) 

5) yejD 

atgcgacttgataaatttatcgcacagcaactcggcgttagccgtgctattgccgggcgtgaaatccgcggcaatcgtgtcacc 

gtcgatggcgaaatcgtccgtaatgcagcgttcaaactgcttcctgaacatgatgtcgcttacgatggcaacccgctggcgcag 

caacacggtccacgttactt'catgctcaataagcctcagggctatgtttgctccacggacgaccctgatcacccaacggtg 

attttcttgatgaaccggtagcgtggaaactgcatgcggcggggcggttggatattgataccaccggtctggtgctgatgactga 

tgatggtcagtggtcgcaccgcattacttctccgcgccatcattgcgagaagacctatctggtgacactggaatcacctgtagct 

gacgatacggcagagcaatttgctaaaggGgtgcagctgcataacgaaaaagatctcactaagcctgcggtgctggaagtg 

attaccccaacgcaggttcgtctgaccatcagcgaagggcgttatcatcaggtgaaacgcatgttcgccgccgtgggtaacca 

cgtggttgagctgcatcgtgaacgtattggcggtattacgctggatgrt^ 

aagaaattgccagcgtcgtctaa (SEQ ID NO: 20) 
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6) yrfl 

atgattatgccgcaacatgaccaattacatcgctatctgtttgaaaactttgccgtgcgcggcgaactggtaaccgtttcggaaa 

ccctgcaacagatccttgagaaccacgattatccgcagcccgttaaaaacgtgctggcagaactgctggttgcgaccagcct 

gttaaccgctacgctgaagtttgatggtgatatcaccgtacagctgcagggcgacggtccgatgaatctggcggttattaacggt 

aacaataaccagcagatgcgcggtgtggcgcgcgtgcagggcgaaattccagaaaatgccgacctgaaaacgctggtcg 

gcaatggttacgtggtgatcaccattaccccgagcgaaggcgaacgctatcagggcgtagttggtctggaaggtgataccctg 

gcggcctgcctggaagattactttatgcgttctga'acagctgccgacgcgcctgtttattcgcaccggcgacgtagacggcaaa 

ccggctgcaggcggtatgttgttgcaggtaatgcctgcgcaaaatgcccagcaggacgactttgaccacctggcgacgctaa 

ccgaaaccatcaaaaccgaagaactgctgaccttaccggcaaacgaagtgttgtggcgtttgtatcacgaagaagaggtga 

cggtttacgatccgcaggatgtggagttcaaatgcacctgctcgcgtgaacgttgcgccgatgcgctgaaaacgctgcctgatg 

aagaagttgatagcatcctggcggaagatggcgaaattgacatgcattgtgattactgcggtaaccactatctgttcaatgcgat 

ggatattgctgaaatccgcaacaacgcgtctccggcagatccgcaagttcattaa (SEQ ID NO: 21) 

7) yggJ 

gtggggagacgacgcggatttttaactatgcgtatcccccgcatttatcatcctgaaccactgaccagccattctcacatcgcgct 

ttgcgaagatgccgccaaccatatcgggcgcgtactgcgcatggggccggggcaggcgttgcaattgtttgacggtagcaac 

caggtctttgacgccgaaattaccagcgccagcaaaaaaagcgtggaagtgaaggtgctggaaggccagatcgacgatcg 

cgaatctccgctgcatattcacctcggtcaggtgatgtcgcgtggtgaaaaaatggaatttactatccagaaatcgatcgaactc 

ggtgtaagcctcattacgccacttttttctgagcgctgcggcgttaaactggatagtgaacgtctgaacaagaagcttcagcagt 

ggcagaagattgcaattgctgcctgtgagcagtgtggtcgtaaccgggtgccggaaatccgtccagcgatggatctggaagc 

ctggtgtgcagagcaggatgaaggactgaaactgaatcttcacccgcgcgccagtaacagcatcaatacgttgccgttaccg 

gttgaacgcgtccgcctgctgattggcccggaaggcggtttatcggcagatgaaattgccatgactgcccgctatcaatttactg 

atatcctgttgggacctcgcgttttgcgtacagagacaactgcgctcaccgccattaccgcgctacaagtacgatttggcgatttg 

ggctaa (SEQ ID NO: 22) . , 



atgatgaatcgagtaattccgctccctgatgagcaggcaacattagacctgggcgagcgggtagcgaaagcctgcgatggc 
gcaaccgtaatctatctgtatggcgatttaggcgcaggtaaaaccacctttagccggggctttttacaggctctgggtcatcagg 

gtaatgtcaaaagccccacttatacgctggtcgaa 

ccgatcccgaggagctggagtttatggggatccgcgattattttgccaacgatgccatctgcctggtggagtggccacaaca 
ggtacaggtgttcttcctgacccggatgtcgaaatacacattgattatcaggcacaaggccgtgaggcgcgcgtgagtgcggtt 

tcctctgcgggtgaattgttgctggcgcgtttagccggttaa 
(SEQ ID NO; 23) 



atgaaattacgctctgtaacctacgcattattcattgctggcctggctgcattcagcacatcttctctggcggcacaatctttacgm 

cggttatgaaacatcacaaaccgactcgcaacatattgcggcgaaaaaattcaatgatttattgcaggagagaaccaaaggc 

gagctgaaattaaaactgttcccggacagcactetcggtaacgcgcaggcgatgatcagcggcgtacgtggcggcaccatc 

gatatggaaatgtccggctcgaataactttgccgggttatcaccagtgatgaacttgcttgatgtecctttcctgttccgcgataccg 

ctcacgcgcataaaacgctcgacggcaaagtcggtgatgatctgaaagcctcacttgaaggtaaaggactgaaagtactgg 

cctactgggaaaacggctggcgcgatgtcaccaactcgcgcgGaccggttaaaacccccgccgacctgaaagggctgaaa 

atccgcaccaacaatagcccgatgaatatcgccgcattcaaagtctttggcgctaacccgatcccgatgccgtttgccgaagtc 

tataccgggctggaaacccgcactatcgacgctcaggaacacccgatcaacgtcgtctggtcagcaaaatttttcgaagtgca 

gaagttcctttctGtgacgcaccacgcctattccccgcttctggtggtgatcaacaaagcgaagtttgatggcttaagtc^^ 

ccagcaggcgctagtttcatctgcacaagaagcgggtaactatcagcgcaaactggttgctgaagatcagcaaaaaatcatc 

gacggcatgaaagaagcgggcgtggaagtcatcaccgatctcgaccgcaaagcctttagcgacgcactggggaatcaggt 

tcgcgacatgtttgttaaagatgtgccgcagggagctgatctgctgaaagccgtggatgaggtgcaataa (SEQ ID NO: 

24) 

10)yrdC 

gtgaataataacctgcaaagagacgctatcgcagctgcgatagatgttctcaatgaagaacgtgtcatcgcctatccaacgga 
• agccgttttcggtgttgggtgcgatcctgatagcgaaacagcagtgatgcgactgttggagttaaaacagcgtccggttgataa 
ggggctgattttaatcgcagcaaattacgagcagcttaaaccctatattgatgacaccatgttgactgacgtgcagcgtgaaac 
camtttcccgctggceaggtcctgtcacctttgtctttcccgcgcctgcgacaacaccgcgctggttgacgggccgctttgattcg 
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cttgctgtacaagtcaccgaccatccgttggtggttgGtttgtgccaggcttatggtaaaccgctggtttctaccagtgccaacttga 
gtggattgccaccttgtcgaawgtagaGgaagttcgcgcacaatttggcgcggcgttcccggttgtgcctggtgaaaegggg 
gggcgtttaaatccttcagaaatccgcgatgccctgacgggtgaactgtttcgacaggggtaa (SEQ ID NO: 25) 

11) yhbC ' ' 

gtgggcttgtccacattagagcaaaaattaacagagatgattactgcgccagttgaggccqtgggttttgaactggttggcatcg 

aatttattcgcggtcgcacatccacactgcgcatctatattgatagtgaagatggcatcaatgttgatgattgtgctga'tgtgagcc 

accaggtaagtgctgtgGtggatgttgaagatcccatcaccgttgcttataacctggaagtctcctcaccgggtctcgatcgccc 

actgttcacggctgaacactacgcccgttttgteggagaagaggtgactctggttctccgtatggcggtacaaaaccgtcgtaa 

atggcagggcgttatcaaagcggtagacggfgaaatgatcacagttaccgtcgaaggtaaagatgaagtgttcgcgctgagt 

aatatccagaaggcgaaoctggttccccacttttaa (SEQ ID NO: 26) 

12) ygbP 

atggcaaccactcatttggatgtttgcgccgtggttccggcggccggatttggccgtcgaatgcaaacggaatgtcctaagcaa 
tatctctcaatcggtaatcaaaccattcttgaacactcggtgcatgcgctgctggcgcatccccgggtgaaacgtgtcgtcattgc 
cataagtcctggcgatagccgttttgcacaacttcctctggcgaatcatccgcaaatcaccgttgtagatggcggtgatgagcgt 
gccgattccgtgctggcaggtctgaaagccgctggcgacgcgcagtgggtattggtgcatgacgccgctcgtccttgtttgcatc 
aggatgacctcgcgcgattgttggcgttgagcgaaaccagccgcacgggggggatcctcgccgcaccagtgcgcgatacta 
tgaaacgtgccgaaccgggcaaaaatgccattgctcataccgttgatcgcaacggcttatggcacgcgctgacgccgcaattt 
ttccctcgtgagctgttacatgactgtctgacgcgcgctctaaatgaaggcgcgactattaccgacgaagcctcggcgctggaa 
tattgcggattccatcctcagttggtcgaaggccgtgcggataacattaaagtcacgcgcccggaagatttggcactggccga 

gttttacctcacccgaaccatccatcaggagaatacataa (SEQ ID NO: 27) 

ataag^caggtgatcctcgatttacaactggcatgtgaagataattccgggttaccggaagagagccagtttcagacatggctg. 

aatgcggtgatcccgcagtttcaggaagaatcggaagtgacgattcgcgtggtcgataccgccgaaagccacagtctgaatc 

tgacctatcgcggtaaggataagccgaccaacgtgctctccttcccgtttgaagtgccgcctggcatggaaatgtcgctactgg 

gcgatctggttatctgccgtcaggtggttgagaaggaagctcaggagcaaggcaaaccactggaggcgcactgggcgcata 

tggtggtgcacggcagtctgcatttgttaggttacgatcacatcgaagatgacgaagcagaagaaatggaagccctcgaaac 

agagattatgcttgctctgggctatgaggatccgtacattgccgagaaagaataa (SEQ ID NO: 28) 

14) gcpE 

atgcataaccaggctccaattcaacgtagaaaatcaacacgtatttacgttgggaatgtgccgattggcgatggtgctcccatc 

gccgtacagtccatgaccaatacgcgtacgacagacgtcgaagcaacggtcaatcaaatcaaggcgctggaacgcgttgg 

cgctgatatcgtccgtgtatccgtaccgacgatggacgcggcagaagcgttcaaactcatcaaacagcaggttaacgtgccg 

ctggtggctgacatccacttcgactatcgcattgcgctgaaagtagcggaatacggcgtcgattgtctgcgtattaaccctggca 

atatcggtaatgaagagcgtattcgcatggtggttgactgtgcgcgcgataaaaacattccgatccgtattggcgttaacgccgg 

atcgctggaaaaagatctgcaagaaaagtatggcgaaccgacgccgcaggcgttgctggaatctgccatgcgtcatgttgat 

catctcgatcgcctgaacttcgatcagttcaaagtcagcgtgaaagcgtctgacgtcttcctcgctgttgagtcttatcgtttgctgg 

caaaacagatcgatcagccgttgcatctggggatcaccgaagccggtggtgcgcgcagcggggcagtaaaatccgccattg 

gtttaggtctgctgctgtctgaaggcatcggcgacacgctgcgcgtatcgctggcggccgatccggtcgaagagatcaaagtc 

ggtttcgatattttgaaatcgctgcgtatccgttcgcgagggatcaacttcatcgcctgcccgacctgttcgcgtcaggaatttgatg 

ttatcggtacggttaacgcgctggagcaacgcctggaagatatcatcactccgatggacgtttcgattatcggctgcgtggtgaa 

tggcccaggtgaggcgctggtttctacactcggcgtcaccggcggcaacaagaaaagcggcctctatgaagatggcgtgcg 

caaagaccgtctggacaacaacgatatgatcgaccagctggaagcacgcattcgtgcgaaagccagtcagctggacgaa 

gcgcgtcgaattgacgttcagcaggttgaaaaataa 
(SEQ ID NO: 29) 

15) kdtB 

' atgcaaaaacgggcgatttatccgggtactttcgatcccattaccaatggtcatatcgatatcgtgacgcgcgccacgcagatgt 
tcgatcacgttattctggcgattgccgccagccccagtaaaaaaccgatgtttaccctggaagagcgtgtggcactggcacag 
caggcaaccgcgcatctggggaacgtggaagtggtcgggtttagtgatttaatggcgaacttcgcccgtaatcaacacgctac 
ggtgctgattcgtggcctgcgtgcggtggcagattttgaatatgaaatgcagctggcgcatatgaatcgccacttaatgccggaa 
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ctggaaagtgtgtttctgatgccgtcgaaagagtggtcgtttatctcttcatcgttggtgaaagaggtggcgcgccatcagggcg 
atgtcacccatttcctgccggagaatgtccatcaggcgctgatggcgaagttagcgtag (SEQ ID NO: 30) 

16) pfs 

atgaaaatcggcatcattggtgcaatggaagaagaagttacgctgctgcgtgacaaaatcgaaaaccgtcaaactatcagtc 

tcggcggttgcgaaatctataccggccaactgaatggaaccgaggttgcgcttctgaaatcgggcatcggtaaagtcgctgcg 

gcgctgggtgccactttgctgttggaacactgcaagccagatgtgattattaacaccggttctgccggtggcctggcaccaacgt 

tgaaagtgggcgatatcgttgtctcggacgaagcacgttatcacgacgcggatgtcacggcatttggttatgaatacggtcagtt 

accaggctgtccggcaggctttaaagctgacgataaactgatcgctgccgctgaggcctgcattgccgaactgaatcttaacg 

ctgtacgtggcctgattgttagcggcgacgctttcatcaacggttctgttggtctggcgaaaatccgccacaacttcccacaggc 

cattgctgtagagatggaagcgacggcaatcgcccatgtctgccacaatttcaacgtcccgtttgttgtcgtacgcgccatctcc 

gacgtggccgatcaacagtctcatcttagcttcgatgagttcctggctgttgccgctaaacagtccagcctgatggttgagtcact 

ggtgcagaaacttgcacatggctaa (SEQ ID NO: 31) 

17) yea J 

gtgagcaatctgtcgctcgatttttcggataatacttttcaacctctggccgcgcgtatgcggccagaaaatttagcacagtatatc 

ggccagcaacatttgctggctgcggggaagccgttgccgcgcgctatcgaagccgggcatttacattctatgatcctctggggg 

ccgccgggtaccggcaaaacaactctcgctgaagtgattgcccgctatgcgaacgctgatgtggaacgtatttctgccgtcac 

ctctggcgtgaaagagattcgcgaggcgatcgagcgcgcccggcaaaaccgcaatgcaggtcgccgcactattctttttgttg 

acgaagttcaccgtttcaacaaaagccagcaggatgcatttctgccacatattgaagacggcaccatcacttttattggcgcaa 

ccactgaaaacccgtcgtttgagcttaattcggcactgcmcccgtgcccgtgtctatctgttgaaatccctgagtacagag 

ttgagcaagtactaactcaggcgatggaagacaaaacccgtggctatggtggtcaggatattgttctgccagatgaaacacg 

acgcgccattgctgaactggtgaatggcgacgcgcgccgggcgttaaatacgctggaaatgatggcggatatggccgaagt 

cgatgatagcggtaagcgggtcctgaagcctgaattactgaccgaaatcgccggtgaacgtagcgcccgctttgataacaaa 

ggcgatcgcttttacgatctgatttccgcactgcataagtcggtacgtggtagcgcacccgatgcggcgctgtactggtatgcgc 

gaattattaccgttggtggcgatccgttatatgtcgcgcgtcgctgtctggcgattgcgtctgaagacgtcggt^ 

cgggcgatgcaggtggcaattgcggcctgggattgctttactcgcgttggcccggcggaaggtgaacgcgccattgctcaggc 

gattgtttacctggcctgcgcgccaaaaagcaacgctgtctacactgcgtttaaagccgcgctggccgatgctcgcgaacgcc 

cggattatgacgtgccggttcatttgcgtaatgcgccgacgaaattaatgaaggaaatgggctacgggcaggaatatcgttac 

gctcatgatgaagcaaacgcttatgctgccggtgaggtttacttcccgccggaaatagcacaaacacgctattatttcccgaca 

aacaggggccttgaaggcaagattggcgaaaagctcgcctggctggctgaacaggatcaaaatagccccataaaacgcta 

ccgttaa (SEQ ID NO: 32) 

18) b1808 . ' 
gtgacggacgattttgcaccagacggtcagctggcgaaagcgataccaggctttaagccgcgagaaccacagcgacagat 

ggcggtagccgtcacccaggcgatagaaaaaggccagccgctggtggtggaagcaggaaccggtacgggcaaaaccta 
cgcttacctggctcctgcgctgcgggcgaaaaaga 

gccgcgatttgccaacagtctcaaaggcattgaaatatacgggcaacgtggcgctgctgaaagggcgctcaaactacctctg 
cctcgaacgtcicgaacagcaggcgctggcggggggcgatctgccggtacaaatcttaagcgatgtgatcctgctgcg^ 
ggtctaatcaaacagtcgatggtgatatcagcacctgcgtcagcgtggcggaagattcacaggcgtggccgctggtcaccag 
caccaacgacaactgtcttggcagcgactgcccgatgtataaagattgctttgtggtcaaagcacgtaaaaaagcgatggac 
gccgatgtggtggtggtaaaccatcatctctttctggcggatatggtggttaaagagagtggatttggcgaactgatcccggaag 
cggacgtcatgatcttcgacgaagcccaccagctaccggacattgccagccagtattttggtcagtcactctccagtcgacaac 
tgctcgacctggcaaaagacatcaccatcgcctaccgcaccgaattaaaagacacccagcagttacaaaagtgcgctgatc 
gtcttgcccagagtgcgcaggamtcgtctgcaactcggtgagccaggttatcgcggtaacctgcgtgagctgttagctaa^ 
. caaattcagcgggcatttttactgctcgatgacaccctggaactttgttatgacgtggcgaaactgtcactggggcgttccgccttg 
ctggatgcggcatttgagcgcgccacgttgtatcgcaca.cggctgaagcggctaaaagagatcaatcagccgggctacagct 
actggtacgaatgcacttcgcgccattttactctggctctcacgccgctcagcgtggcggataaattcaaagagttaatggcgca 
aaaacGcggtagctggatcttcacctcagcaacgctgtcggtgaacgacgatctgcatcatttcacctcgcggcttggcatcga 
araggccgagtcgttgctgttgcccagcccamg 

. aaccagccaggttctgctcgccagttagcggcaatgctgcgaccgatcatcgaagctaacaacggtcgttgttttatgctttgtac 
ctcgcacgccatgatgcgcgatctggccgagcagttccgcgctaccatgacgcttcctgtattgttgcagggggaaaccagca 
aagggcaactgttgcagcaatttgtcagcgccggtaatgcgcttcttgtggcaaccagcagtttctgggaaggggtggacgtgc 
gtggcgatacattgtcattggtaattatcgacaaattgccgtttacctcgccggatgatccactgttaaaagcgcgcatggaagat 
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tgtcgtttgcgcggtggcgacccgttcgatgaagtgcaactaccagatgccgtcattactctcaaacagggggtagggcgact 
gattcgcgacgccgacgatcgtggcgtgctggtgatttgtgacaatcggctggtgatgcgtccttacggcgcgacgtttctcgcc 
agtctgccgcccgcgccacgcacccgtgacattgcccgtgcggttcgtttccttgcgataccatcctccaggtaa (SEQ ID 

NO: 33) 

19) yeaA 

atggctaataaaccttcggcagaagaactgaaaaaaaatttgtccgagatgcagttttacgtgacgcagaatcatgggacag 
aaccgccatttacgggtcgtttactgcataacaagcgtgacggcgtatatcactgtttgatctgcgatgccccgctgtttcattcc^ 
aaaecaagtatgattccggctgtggctggcccagtttctacgaaccggtaagtgaagaatccattcgttatatcaaagacttgtc 
acatggaatgcagcgcatagaaattcgttgcggtaactgtgatgcccatctggggcatgttpttccccgacgggccgcagccaa 
cgggcgaacgttattgtgttaactctgcctctttacgctttaccgatggcgaaaacggcgaagaaatcaacggttga (SEQ 

ID NO: 34) 

20) yagF ' 
atgaccattgagaaaattttcaccccgcaggacgacgcgttttatgcggtgatcacccacgcggcggggccgcagggcgctc 

tgccgctgaccccgcagatgctgatggaatctcccagcggcaacctgttcggcatgacgcagaacgccgggatgggctggg 

acgccaacaagctcaccggcaaagaggtgetgattatcggcactcagggcggcatccgcgccggagacggacgcccaat 

cgcgctgggctaccacaccgggcattgggagatcggcatgcagatgcaggcggcggcgaaggagatcacccgcaatggc 

gggatcccgttcgcggccttcgtcagcgatccgtgcgacgggcgctcgcagggcacgcacggtatgttcgattccctgccgta 

ccgcaacgacgcggcgatcgtgtttcgccgcctgatccgctccctgccgacgcggcgggcggtgatcggcgtagcgacctg 

cgataaagggctgcccgccaccatgattgcgctggccgcgatgcacgacctgccgactattctggtgccgggcggggcgac 

gctgccgccgaccgtcggggaagacgcgggcaaggtgcagaccatcggcgcgcgtttcgccaaccacgaactctccctgc 

aggaggccgccgaactgggctgtcgcgcctgcgcctcgccgggGggcgggtgtcagttcctcggcacggcgggcacctcg 

caggtggtcgcggaggcgctgggtctggcgctgccgcactccgcgctggcgccgtccgggcaggcggtgtggctggagatc 

gcccgccagtcggcgcgcgcggtc^gcgagctggatagccgcggcatcaccacgcgggatatcctctccgataaagc^ 

cgaaaacgcgatggtgatccacgcggcgttcggcggctc^ 

ggctgcacgatcccggacgttgagGactggacgcgcatcaaccgtaaagtgccgcgtctggtgagcgtgctgcccaacggc 

ccggactatcacccgaccgtgcgcgccttcctcgcgggcggcgtgccggaggtgatgctccacctgcgcgacctcggc^ 

tgcatctggacgccatgaccgtgaccggccagacggtgggcgagaaccttgaatggtggcaggcgtccgagcgccgggcg 

cgcttccgccagtgcctgcgcgagcaggacggcgtagagccggatgacgtgatcctgccgccggagaaggcaaaagcga 

aagggctgacctcgacggtctgcttcccgacgggcaacatcgctccggaaggttcggtgatcaaggccacggcgatcgacc 

cgtcggtggtgggcgaagatggcgtataccaccacaccggccgggtgcgggtgtttgtctcggaagcgcaggcgatcaagg 

cgatcaagcgggaagagattgtgcagggcgatatcatggtggtgatcggcggcgggccgtccggcaccggcatggaagag 

acctaccagctcacctccgcgctaaagcatatctcgtggggcaagacggtgtcgctcatcaccgatgcgcgcttctcggg^ 

gtcgacgggcgcctgcttcggccacgtgtcgccggaggcgctggcgggcgggccgattggcaagctgcgcgataacgaca 

tcatcgagattgccgtggatcgtctgacgttaactggcagcgtgaacttcatcggcaccgcggacaacccgctgacgccgga 

agagggcgcgcgcgagctggcgcggcggcagacgcacccggacctgcacgcccacgactttttgccggacgacacccg 

gctgtgggcggcactgcagtcggtgagcggcggcacctggaaaggctgtatttatgacaccgataaaattatcgaggtaatta 

acgccggtaaaaaagcgctcggaatttaa (SEQ ID NO: 35) 

21) b1983 

gtgggacgtaaatgggccaatattgttgctaaaaaaacggctaaagacggtgcaacgtcta'aaatttatgcaaaattcggtgta 
gaaatctatgctgctgctaaacaaggtgaacccgatccagaattaaacacatctttaaaattcgttattgaacgtgcaaagcag 
gcacaagttccaaagcacgttattgataaagcaattgataaagccaaaggcggcggagatgaaacgttcgtgcagggacgt 
tatgaaggctttggtcctaatggctcaatgattatcgcc'gagacattg^ 

attttcaataaaaaaggcggcaatatcggagcggcaggttctgtcagctatatgtttgacaatacgggtgtgattgtatttaaagg 
gacagaccctgaccatatttttgaaattttacttgaagctgaagttgatgttcgtgatgtgactgaagaagaaggtaacattgttatt 
tatactgaacctactgaccttcataaaggaatcgcggctctaaaagcagctggaatcactgagttctcaacaacagaattaga 

aatgattgctcaatctgaagttgagctttccccag 
atgtacaaaaagtttatcataacgtcgcaaatctctaa (SEQ iD NO: 36) 

22) yidD 

atggcgccgccactgtcgcctggctcgcgggtcctgatagccctcattcgggtctatcaacgcctgattagtccgctactcgggc 
cgcattgtcgtttc'actccaacctgttcaagctacggaattga'ggcattgcgcaggtttggagtgataaaaggcagttggttgacg 



Fig. 1 continued 



WO 00/61793 PCT/EP00/03135 

6/15 

gtgaaacgcgtattaaaatgccaccctttacaccctggtggtgacgatcccgtcccgcccggaccatttgataccagagaaca 
ctaa (SEQ ID NO: 37) 



23) yceG . 
atgaaaaaagtgttattgataatcttgttattgctggtggtart^ 

ccgacagcaaattgcttatcaaagaagagacgatatttaccctgaagccagggaccggacgtctggcgctcggtgaacagc 

tttatgccgataagatcatcaatcgtccacgggtttttcaatggctgctgcgtatcgaaccggatctttctcactttaaagccgg 

ttaccgctttacaccgcagatgaccgtgcgcgagatgctgaaattgctggaaagcggtaaagaagcacagte 

tggtagaagggatgcgtctgagcgattacctcaagcaattgcgtgaggccccgtatatcaagcatacgctgagcgatgataa 

tacgccaccgtagcgcaggcacttgaactggaaaacccggagtggattgaaggttggttctggccagacacctggatgtata , 

ccgccaataccaccgatgtcgcgttaetcaagcgagcgcacaagaaaatggtgaaagcggtcgatagcgcctgggaagg 

gcgtgcggacggtctgccttataaagataaaaaccagttggtgacgatggcatcaattatcgaaaaagaaaccgccgttgcc 

agtgaacgcgataaggttgcctcagtatttatcaaccgtttacgcattggtatgcgcctgcagaccgacccgaccgtgatttacg 

"ggatgggagagcgttataatggcaaactttctcgtgcagacctggaaacgccgacagcgtataacacctataccattaccggt 

ctgccgccaggtgcgatagcgacgccgggggcggattcgctgaaggctgctgcgcatccggcaaaaacgccgtatctctatt 

ttgtggccgatggtaaaggtggtcacacgmaafaccaatcttgccagtcataacaagtctgtgcaggattatctgaaagtgctt 

aaggaaaaaaatgcgcagtaa 

(SEQ ID NO:38) . ~ ■ 

24) yjbC ' 

atgctgcccgactcatcagtccgtttaaataaatacatcagcgaaagcggaatttgctcacgccgcgaagcggatcgctatatc 

gagcaaggcaatgtgttccttaatggcaagcgagccaccattggcgatcaggtgaaacccggcgacgttgtgaaagtaaac 

ggtcagttgattgaacctcgggaagccgaagatttggtacttatcgccctgaacaagcccgttggtattgtaagcaccaccgaa 

gatggcgagcgcgataacattgtcgatttcgttaaccacagcaaacgcgtgttcccgattggccgcctggataaagactccca 

ggggctgattttcctcaccaatcacggcgatctggtgaataagatcctgcgtgctggcaatgatcatgagaaagagtatctggt^ 

acggtcgataaaccgattaccgaggagtttattcgcggcatgagtgcgggggtgccaatcctcgggacagtgaccaaaaagt 

gcaaagttaaaaaagaagcgccgtttgtcttccgcattaccctggtgcaggggctgaaccgtcagatccggcgcatgtgcga 

gcatttcggctatgaagtgaaaaagctggaacgcacgcgcatcatgaacgttagcttaagcggcattccgctgggggaatgg 

cgcgatttaaccgacgatgagttaatcgacctctttaagctcattgaaaattcctcttccgaggtaaaacctaaagcgaaggcc 

aaaccgaaaacagcgggcatcaaacgtccagtcgttaagatggaaaaaacggcggaaaaaggcggtcgcccggcgtcc 

aacggtaagcgttttacctcgccggggcgtaaaaagaaggggcgctga (SEQ IDNO:39) 
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SEQUENCE .LISTING 

<110> GPC Biotech AG 

<120> Novel method for ■ identifying antibacterial compounds 

<130> D 1400 PCT 

<140> 
<141> 

<160> 45 ■ 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 39 - 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description'of Artificial Sequence: artificial 
sequence 

<400> 1 

atggttataa atttggagtg tgaaggttat tgcgtgtaa 



<210> 2 
<211> 33 
<212> DNA 

<213> Artificial Sequence . 
<220> 

<223> Description of Artificial Sequence: artificial 
sequence 

<400> 2 

gttataaatt tggagtgtga aggttattgc gtg J 



<210> 3 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 
sequence 

<400> 3" 

cacgcaataa ccttcacact ccaaatttat aac 



<210> 4 
<211> 36 
<212> DNA 

<213> Artificial- Sequence 
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<220> . 
<223> Description of Artificial Sequence: artificial 
sequence ■ ~ 

<400> 4 

gatcggatcc aaattccagt tagccatgat gcggtc 



<210> 5 
<211> 54 
<212> DNA 

<213> Artificial Sequence 
, <220> 

<223> Description of Artificial Sequence: artificial 
sequence 

<400> 5 - - • 

cacgcaataa cctrcacact' ccaaatttat aaccattata cacggacgct atgc 



<400> 7 

gatcgtcgac gcgctcgata tcaccgatga acaaccg 



<210> 8 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: artificial 
sequence 



36 



54 



<210> 6 * . .... 

<211> 55 
<212> DNA 

<213> Artificial Sequence 

<220> > . 

<223> Description of Artificial Sequence: artificial 
sequence - - 

<400> 6 

gttataaatt tggagtgtga aggttattgc gtgacggatt aattttgttt - ctctt- 5b 



<210> 7 
<211> 37 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence;, artificial 
sequence 



37 



<400> 8 . 

caatccgctg ctttatttct gtcag 



25 



WO 00/61793 



3 
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<210> 9 ' 
.<211> 25 

<212> DNA ^ ' 

<213> Artificial Sequence ' ' " 

<220> 

<223> Description of Artificial Sequence: artificial 
sequence 

<400> 9 • 

ttataacgaa atcaacggga aacct ^ 



<210> 10 . > / 

<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> • 

<223> Description of' Artificial Sequence: artificial 
sequence 

<400> 10 . 
gatcggatcc ctctaaaaaa taaggaatta aagg 



<210> 11 

<211> 56 - 
< 2 1 2 > DNA 

<213> Artificial Sequence 
<220> 

<223> Description' of. Artificial Sequence: artificial 
sequence 

<400> 11 

cacgcaataa ccttcacact ccaaatttat aaccatagga tacctaatta attaac 



<210>12 
<211> 54 
<212> DNA 

<213> Artificial Sequence . 
<220> 

<223> Description of Artificial Sequence: artificial 
sequence 

<400> 12 

gttataaatt tggagtgtga aggttattgc gtgaagagcg ccatttccca ccgt 



<210> 13 
<211> 36 
<212> DNA 

<213> Artificial Sequence 



25 



34 



56 



54 



<220> 
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<223> Description cf Artificial . Sequence : artificial 
sequence 

<400> 13 ■ 
gatcgtcgac tcataztcct gataacccgc tgcggt 36 f 



<210> 14 
<211> 24 
<212> DNA 

<213> Artificial Sequence 

<220> , 
<223> Description cf Artificial Sequence artificial 
sequence 

<400> 14 24 
gttgacggcc aggccaacag teat 



<210> 15 
<211> 22 
<2-l2> DNA 

<213> Artificial Sequence 

<220> - . 1 

<223> Description of Artificial Sequence: artificial 

. sequence 



<4 00> 15 

ataaccctgg gcaatcgcct eg 



<210> 16 
<211> 480 
<212> DNA , 

<213> Escherichia coli 



22 



<400> 16 

atgcgaattg gacacggttt tgaegtacat gcctttggcg gtgaaggece aattatcatt 60 

ggtggcgtac gcattcctta cgaaaaagga ttgctggcgc attctgatgg cgacgtggcg 120 - 

ctccatgcgt tgaccgatgc attgettgge gcggcggcgc tgggggatat eggcaagctg 180 

ttcceggata ccgatccggc atttaaaggt gccgatagcc gegagctget acgcgaagcc 240 

tggcgtcgta ttcaggcaaa gggttatacc cttggcaacg tcgatgtcac tatcatcget 300 

caggcaccga agatgttgcc gcacattcca caaatgegeg tgtttattgc cgaagatctc 360 

ggctgecata tggatgatgt taacgtgaaa gccactacta eggaaaaact gggatttacc 420 
ggacgtgggg aagggattgc ctgtgaagcg gtggcgctac tcattaaggc aacaaaatga 480 

<210> 17 

<211> 531 ' 
<212> DNA 

<213> Escherichia coli 

<400> 17 ' 

atgcgccgcg cttttataac eggagtttte tttttgtctg aagtcgaatt tagecacgaa 60 
"tactggatgc gtcacgcact gacgctggcg aaacgtgcct gggatgagcg ggaagtgccg 120. 
gteggegegg tattaatgea taacaategg gtaatcggcg aaggctggaa ccgcccgatt 180 
ggtcgccatg atcccaccgc acatgeagaa atcatggccc tgcggcaggg tggtctggtg 240 
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5 

atgcaaaatt atcgtctgat cgacgccacg ttgtatgtca cgcttgaacc atgtgtaatg 300 
tgtgccggag cgatgatcca cagtcgcatt ggtcgcgtgg tcrttggtgc gcgtgacgcg 360 
aaaactggcg ctgcgggatc tttaatggat gtgctgcatc atccgggtat gaatcaccga 420 
gtggaaatta cggaaggaat actggcggat gagtgcgcgg cgttgctcag -tgacttcttt ■ 4 80 
cgcatgcgcc gccaggaaat taaagcgcag aaaaaagcgc aatcctcgac ggattaa 537-., 



<210> 18 
<211> 621 
<212> DNA 

<213> Escherichia coli 



<400> 18 

atgaggtata 

gcgtttgctg 

gaaccaggtg 

gctgatggaa 

aaaaactggc 

cagcaagcta 

tataaaaaag 

accatgcagc 

acgcgcgaag 

gctatcgcat 

gtctcacagg 



tagttgcctt 
atctcggaat 
cacctgcgct 
cattgcagcg 
ttaacgccct 
cttcccccta 
cgaatcgagt 
gcgatgatgt 
cccgccttgc 
cggatgttgc 
aaaaaccgta 



aacgggaggc 
taacgtcatt 
acatgccatt 
ccgggccttg 
gctgcatccg 
tgtactgtgg 
gcttgtggtg 
aactcgcgag 
cgtggcagat 
ccgcctgcac 
a 



attggcagtg 
gatgccgata 
gctgatcact 
cgcgagcgga 
ctgat-tcagc 
gttgtgccat 
gatgtcagcc 
catgtcgaac 
gacgtcattg 
gcacactatt 



gcaagagtac 
ttattgcgcg 
ttggcgctaa 
tcttcgccaa 
aagagacgca 
tgctggtaga 
cagaaacgca 
aaatccttgc 
ataataacgg 
tgcagcttgc 



cgttgccaat 60 
tcaggtggtt 120 
catgattgct 180 
cccggaagag .240 
acaccagatc 300 
aaactcactg 360 
acttaagcgc 420 
tgctcaggca 480 
cgcaccggat 540 
gtcgcagttt 600 
621 



<210> 19 
<211> 852 
<212> DNA 

<213> Escherichia coli 



■<4 00> 19 
atgcggacac 
cgtgcggatg 
atcagcattg 
gaacatgaag 
"agcgggcgtc 
ggcggcggtc 
ctctggcaat 
gatgttcctg 
acgccggtgg 
actccggtga 
acgttgctaa 
gaggttgatg 
ggggcctgtg 
gccccggaat 
gccatgcttt 



agtggccctc 
gttaccacac 
agcttcgtga 
ataacctgat 
ttccgacggc 
tcggcggtgg 
gcgggctaag 
tctttgttcg 
atccgccaga 
tttttaaaga 
aatgtgaatt 
cggtgctttc 
tctttgctga 
ggctcaatgg 
aa 



tccggcaaaa 
gctgcaaacg 
cgatggggat 
cgttcgcgca 
aagcggtgcg 
.ttcatccaat 
catggatgag 
ggggcatgcc 
gaagtggtat 
tcctgaactc 
cagcaatgat 
ctggctgtta 
atttgataca 
ctttgtggcg 



cttaatctgt 
ctgtttcagt 
attcgtctgt 
gcgcgattgt 
aatatcagca 
gccgcgacgg 
ctggcggaaa 
gcgtttgccg 
ctggtggcgc 
ccgcgcaata 
tgcgaggtta 
gaatacgccc 
gagtctgaag 
aaaggcgcta 



ttttatacat 
ttcttgatta 
taacgcccgt 
tgatgaaaac 
.ttgacaagcg 
tcctggtggc 
tggggctgac 
aaggcgttgg 
accctggtgt 
cgccaaaaag 
tcgcaagaaa 
cgtcgcgcct 
cccgccaggt 
atctttcccc 



taccggtcag 60 
cggcgacacc 120 
tgaaggcgtg 180 
tgcggcagac 240 
tttgccgatg 300 
attaaatcat -360 
gctgggcgca 420 
tgaaatacta 480 
aagtattccg 540 
gtcaatagaa 600 
acgttttcgc 660 
gactgggaca 720 
gctagagcaa 780 
attgcacaga 840 
852 



<210> 20 

<211> 696- 

<212> DNA • 

<213> Escherichia coli 



<400> 20 

atgcgacttg ataaatttat cgcacagcaa ctcggcgtta gccgtgctat tgccgggcgt 60 
gaaatccgcg gcaatcgtgt caccgtcgat ggcgaaatcg tccgtaatgc agcgttcaaa 120 
ctgcttcctg aacatgatgt cgcttacgat ggcaacccgc tggcgcagca acacggtcca 180 
cgttacttca tgctcaataa gcctcagggc tatgtttgct ccacggacga ccctgatcac 240 
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ccaacggtgc tctaztttct tgatgaaccg gtagcgtgga aactgcatgc ggcggggcgg 300 

ttggatattg ataccaccgg tctggtgctg atgactgatg atggtcagtg ctcgcaccgc 360 

attacttctc cgcgccatca ttgcgagaag acctatctgg tgacactgga atcacctgta 4.20 

gctgacgata cggcagagca atttgctaaa ggcgtgcagc tgcataacga aaaagatctc ' 4 80 

actaagcctg cggtgctgga agtgattacc ccaacgcagg tt'cgtctgac catcagcgaa 54 0- 

gggcgttatc atcaggtgaa acgcatgttc gccgccgtgg gtaaccacgt ggttgagctg 600 

catcgtgaac gtattggcgg. tattacgctg gatgctgatt tagcccccgg. tgaatatcgt 660 
ccgttaactg aagaagaaat tgccagcgtc gtctaa 696. 



<210> 21 
<211> 885 
<212> DNA 

<213> Escherichia coli 



<400> 21 

atgattatgc cgcaacatga ccaattacat cgctatctgt ttgaaaactt. tgccgtgcgc 60 
ggcgaactgg taaccgtttc ggaaaccctg caacagatcc ttgagaacca cgattatccg 120 
cagcccgtta aaaacgtgct ggcagaactg ctggttgcga ccagcctgtt aaccgctacg 180 
ctqaaqtttg atggtgatat caccgtacag ctgcagggcg acggtccgat gaatctggcg 240" 



gttattaacg'gtaacaataa ccagcagatg cgcggtgtgg cgcgcgtgca gggcgaaatt 300 
ccagaaaatg ccgacctgaa aacgctggtc .ggcaatggtt acgtggtgat caccattacc 
ccgagcgaag gcgaacgcta tcagggcgta gttggtctgg aaggtgatac cctggcggcc 420 



360 



tgcctggaag attactttat gcgttctgaa cagctgccga cgcgcctgtt tattcgcacc 480 

ggcgacgtagacggcaaacc ggctgcaggc ggtatgttgt tgcaggtaat gcctgcgcaa 540 

aatgcccagc aggacgactt tgaccacctg gcgacgctaa ccgaaaccat caaaaccgaa 600 

gaactgctga ccttaccggc aaacgaagtg ttgtggcgtt tgtatcacga agaagaggtg 660 

acggtttacg atccgcagga tgtggagttc aaatgcacct gctcgcgtga acgttgcgcc 720 

gatgcgctga aaacgctgcc tgatgaagaa gttgatagca ■ tcctggcgga agatggcgaa 780 

attgacatgc attgtgatta ctgcggtaac cactatctgt tcaatgcgat ggatattgct 840 

gaaatccgca acaacgcgtc tccggcagat ccgcaagttc attaa 885 



<210> 22 
<211> 759 
< 2 1 2 > DNA 

<213> Escherichia coli 
<400> 22 

gtggggagac gacgcggatt tttaactatg 
ctgaccagcc attctca'cat cgcgctttgc 
ctgcgcatgg ggccggggca ggcgttgcaa 
gccgaaatta ccagcgccag caaaaaaagc 
gacgatcgcg aatctccgct gcatattcac 
atggaattta ctatccagaa atcgatcgaa 
tctgagcgct gcggcgttaa actggatagt 
cagaagattg caattgctgc ctgtgagcag 
ccagcgatgg atctggaagc ctggtgtgca 
cacccgcgcg ccagtaacag catcaatacg 
ctgattggcc cggaaggcgg tttatcggca 
tttactgata tcctgttggg acctcgcgtt 
attaccgcgc tacaagtacg atttggcgat 



<210> 23 
<211> 462 
<212> DNA 

<213> Escherichia coli 



cgtatccccc gcatttatca tcctgaacca 60 
gaagatgccg ccaaccatat cgggcgcgta 120 
ttgtttgacg gtagcaacca ggtctttgac 180 
gtggaagtga aggtgctgga aggccagatc 240 
ctcggtcagg tgatgtcgcg tggtgaaaaa 300 
ctcggtgtaa ' gcctcattac. gccacttttt 360 
gaacgtctga acaagaagct tcagcagtgg 420 
tgtggtcgta accgggtgcc ggaaatccgt 4 80 
gagcaggatg aaggactgaa actgaatctt 540 
ttgccgttac cggttgaacg cgtccgcctg 600 
gatgaaattg ccatgactgc ccgctatcaa 660. 
ttgcgtacag agacaactgc gctcaccgcc 7-20 
ttgggctaa ^59 
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<400> 23 - 
atgatgaatc 
gtagcgaaag 
aaaaccacct 
cccacttata 
ttgtaccgcc 
aacgatgcca 
gatgtcgaaa 
tcctctgcgg 



gagtaattcc 
cctgcgatgg 
ttagccgggg 
cgctggrcga 
ttgccgatcc 
tctgcctggt 
tacacattga 
gtgaattgtx 



gctccctgat 
cgcaaccgta 
ctttttacag 
accctatacg 
cgaggagctg 
ggagtggcca 
ttatcaggca 
gctggcgcgt 



gagcaggcaa 
atctatctgt 
gctctgggtc 
ctcgacaact 
gagtttatgg 
caacaaggta 
caaggccgtg 
ttacccggtt 



cattagacct 
atggcgattt 
atcagggtaa 
taatggtcta 
ggatccgcga 
caggtgttct 
aggcgcgcgt 



gggcgagcgg 60 
aggcgcaggt 120 
tgtcaaaagc 180 
tcactttgat 240- 
ttattttgcc 300 
tcctgacccg 360 
gagtgcggtt 420 
462 



<210> 24 
<21I> 987 
<212> DNA 

<213> Escherichia coli 



,<400> 24 
atgaaattac 
tcttctctgg- 
catattgcgg 
ttaaaactgt 
ggcggcacca 
atgaacttgc 
gacggcaaag 
gcctactggg 
gccgacctga 
aaagtctttg 
acccgcacta 
gaagtgcaga 
aacaaagcga 
caagaagcgg 
ggcatgaaag 
gcactgggga 
ctgaaagccg 



gctctgtaac 
-cggcacaatc 
cgaaaaaatt- 
tcccggacag 
tcgatatgga 
ttgatgtccc 
tcggtgatga 
aaaacggctg 
aagggctgaa 
gcgctaaccc 
tcgacgctca 
agttcctttc 
agtttgatgg 
gtaactatca 
aagcgggcgt 
atcaggttcg 
tggatgaggt 



ctacgcatta 
tttacgtttc 
caatgattta 
cactctcggt 
aatgtccggc 
tttcctgttc 
tctgaaagcc 
gcgcgatgtc 
aatccgcacc 
gatcccgatg 
ggaacacccg 
tctgacgcac 
cttaagtccg 
gcgcaaactg 
ggaagtcatc 
cgacatgttt 
gcaataa 



ttcattgctg 
ggttatgaaa 
ttgcaggaga 
aacgcgcagg 
tegaataact 
cgcgataccg 
tcacttgaag 
accaactcgc 
aacaatagcc 
ccgtttgccg 
atcaacgtcg 
cacgcctatt 
gagttccagc 
gttgctgaag 
accgatctcg 
gttaaagatg 



.gcctggctgc 
catcacaaac 
gaaccaaagg 
cgatgatcag 
ttgccgggtt 
ctcacgcgca 
gtaaaggact 
gcgcaccggt 
cgatgaatat 
aagtctatac 
tctggtcagc 
ccccgcttct 
aggcgctagt 
atcagcaaaa 
accgcaaagc 
tgccgcaggg 



atrcagcaca 60 
cgactcgcaa 120 
cgagctgaaa 180 
cggcgtacgt 24.0 
atcaccagtg 300 
taaaacgctc 360 
gaaagtactg 420 
taaaaccccc 480 
cgccgcattc 540 
cgggctggaa 600 
aaaatttttc 660 
ggtggtgatc 720 
ttcatctgca 780 
aatcatcgac 840 
ctttagcgac 900 
agctgatctg 960 
987 



<210> 25 
<211> 573 
<212> DNA 

<213> Escherichia coli 



<400> 25 

gtgaataata 

cgtgtcatcg 

acagcagtga 

atcgcagcaa 

cagcgtgaaa 

gcgacaacac 

catccgttgg 

aacttgagtg 

gcgttcccgg 

gccctgacgg 



acctgcaaag 
cctatccaac 
tgcgactgtt 
attacgagca 
ccattttttc 
cgcgctggtt 
tggttgcttt 
gattgccacc 
ttgtgcctgg 
gtgaactgtt 



agacgctatc 
ggaagccgtt 
ggagttaaaa 
gcttaaaccc 
ccgctggcca 
gacgggccgc 
gtgccaggct 
ttgtcgaaca 
tgaaacgggg 
tcgacagggg 



gcagctgcga 
ttcggtgttg 
cagcgtccgg 
tatattgatg 
ggtcctgtca 
tttgattcgc 
tatggtaaac 
gtagacgaag 
gggcgtttaa 
taa 



tagatgttct 
ggtgcgatcc 
ttgataaggg 
acaccatgtt 
cctttgtctt 
ttgctgtacg 
cgctggtttc 
ttcgcgcaca 
atccttcaga 



caatgaagaa 60 
tgatagcgaa 120 
gctgatttta 180 
gactgacgtg 240 
tcccgcgcct 300 
agtcaccgac 360 
taccagtgcc 420 
atttggcgcg 480 
aatccgcgat 540 
573 



<210> 26 
<2li>' 459 
<212> DNA 

<213> Escherichia coii 
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<400> 26 

gtgggcttgt 

ctgggttttg 

tatattgata 

agtgctgtgc 

ccgggtctcg 

gtgactctgg 

■gcggtagacg 

agtaatatcc 



ccacattaga 
aactggttgg 
gtgaagatgg 
tggatgttga 
atcgcccact 
ttctccgtat 
gtgaaatgat 
agaaggcgaa 



gcaaaaatta 
catcgaattt 
catcaatgtt. 
agatcccatc 
gttcacggct 
ggcggtacaa 
cacagttacc 
cctggttccc 



8. 

acagagatga 
attcgcggtc 
gatgattgtg 
accgttgctt 
gaacactacg 
aaccgtcgta 
gtcgaaggta 
cacttttaa 



ttactgcgcc 
gcacatccac 
ctgatgtgag 
ataacctgga 
cccgttttgt 
aatggcaggg 
aagatgaagt 



agttgaggcc 60 
actgcgcatc 120 
ccaecaggta 180 
agtctcctca 240 : 
cggagaagag . 300 
cgttatcaaa 360 
gttcgcgctg 420 
459 



<210> 27 
<211> 711 
<212> DNA 

<213> Escherichia coli 



<400> 27 
atggcaacca 
atgcaaacgg 
tcggtgcatg 
ggcgatagcc 
ggcggtgatg 
tgggtattgg 
tt'ggcgttga 
. atgaaacgtg 
tggcacgcgc 
gctctaaatg 
"catcctcagt 
gcactggccg 



ctcatttgga 
aatgtcctaa 
cgctgctggc 
gttttgcaca- 
agcgtgccga 
tgcatgacgc 
gcgaaaccag 
ccgaaccggg 
tgacgccgca 
aaggcgcgac 
tggtcg.aagg 
agttttacct 



tgtttgcgcc 
gcaatatctc 
gcatccccgg 
acttcctctg 
ttccgtgctg 
cgctcgtcct 
ccgcacgggg 
caaaaatgcc 
atttttccct 
tattaccgac 
ccgtgcggat 
cacccgaacc 



gtggttccgg 
tcaatcggta 
gtgaaacgtg 
gcgaatcatc 
gcaggtctga 
"tgtttgcatc 
gggatcctcg 
attgctcata 
cgtgagctgt 
gaagcctcgg 
aacattaaag 
atccatcagg 



cggccggatt 
atcaaaccat 
tcgtcattgc 
cgcaaatcac 
aagccgctgg 
aggatgacct 
ccgcaccagt 
ccgttgatcg 
tacatgactg 
cgctggaata 
tcacgcgccc 
agaatacata 



tggccgtcga 60 

tcttgaacac 120 

■cataagtcct' 180 

cgttgtagat 240 

cgacgcgcag 300 

cgcgcgattg 360 

gcgcgatact 420 

caacggctta 480 

tctgacgcgc 540 

ttgcggattc 600 

ggaagatttg 660 

a 711 



<210> 28 
<211> 468 
<212> DNA. 

<213> Escherichia coli 



<400> 28 

atgagtcagg 

gagagccagt 

gtgacgattc 

aaggataagc 

tcgctactgg 

ggcaaaccac 

ggttacgatc 

atgcttgctc 



tgatcctcga 
ttcagacatg 
gcgtggtcga 
cgaccaacgt 
gcgatctggt 
tggaggcgca 
acatcgaaga 
tgggctatga 



tttacaactg 
gctgaatgcg 
taccgccgaa 
gctctccttc 
tatctgccgt 
ctgggcgcat 
tgacgaagca 
ggatccgtac 



gcatgtgaag 
gtgatcccgc 
agccacagtc 
ccgtttgaag 
caggtggttg 
atggtggtgc 
gaagaaatgg 
attgccgaga 



ataattccgg 
•agtttcagga 
tgaatctgac 
tgccgcctgg 
agaaggaagc 
acggcagtct 
aagccctcga 
aagaataa 



gttaccggaa 60 
agaatcggaa 120 
ctatcgcggt 180 
catggaaatg 240 
tcaggagcaa 300 
gcatttgtta 360 
aacagagatt 4 20 
4 68 



<210>29 
- <211> 1119 

<212> DNA . 
. <213> Escherichia coli 

<400> 29 

atgcataacc aggctccaat 
ccgattggcg, atggtgctcc 
gtcgaagcaa cggtcaatca 
gtatccgtac cgacgatgga 
gtgccgctgg tggctgacat 
ggcgtcgatt gtctgcgtat 



tcaacgt^aga aaatcaacac 
catcgccgta cagtccatga 
aatcaaggcg ctggaacgcg 
cgcggcagaa gcgttcaaac 
ccacttcgac tatcgcattg 
taaccctggc aatatcggta 



gtatttacgt tgggaatgtg 60 
ccaatacgcg tacgacagac 120 
ttggcgctga tatcgtccgt 180 
tcatcaaaca gcaggttaac 240 
cgctgaaagt agcggaatac 300 
atgaagagcg tattcgcatg 360 
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gtggttgact 
ctggaaaaag 
gccatgcgtc 
aaagcgtctg 
cagccgttgc 
gccattggtt 
gcggccgatc 
cgttcgcgag 
atcggtacgg 
tcgattatcg 
accggcggca 
aacaacgata 
gaagcgcgtc 



gtgcgcgcga 
atctgcaaga 
atgttgatca 
acgtcttcct 
atctggggat 
taggtctgct 
cggtcgaaga 
ggatcaactt 
ttaacgcgct 
gctgcgtggt 
acaagaaaag 
tgatcgacca 
gaattgacgt 



taaaaacatt 
aaagtatggc 
tctcgatcgc 
cgctgt.tgag 
caccgaagcc 
gctgtctgaa 
gatcaaagtc 
catcgcctgc 
ggagcaacgc 
gaatggccca 
c^ggcctctat 
gctggaagca 
tcagcaggtt 



ccgatccgta 
gaaccgacgc 
ctgaacttcg 
tcttatcgtt 
ggtggtgcgc 
ggcatcggcg 
ggtttcgata 
ccgacctgtt 
ctggaagata 
ggtgaggcgc 
-gaagatggcg 
cgcattcgtg 
gaaaaataa 



ttggcgttaa 
cgcaggcgtt 
atcagttcaa 
tgctggcaaa 
gcagcggggc 
acacgctgcg 
ttttgaaatc 
cgcgtcagga 
tcatcactcc 
tggtttctac 
tgcgcaaaga 
cgaaagccag 



cgccggatcg 
gctggaatct 
agtca'gcgtg 
acagatcgat 
agtaaaatcc 
cgtatcgctg 
gc.tgcgtatc 
atttgatgtt 
gatggacgtt 
actcggcgtc 
ccgtctggac 
tcagctggac 



420 

480 

540 . 

600 

660" 

720 

780 . 

840 

900 

960 

1020 

1080 

1119 



<210> 30 
<211> 480 
<212> DNA 

<213> Escherichia coli 



<400> 30 
atgcaaaaac 
atcgtgacgc 
agtaaaaaac 
catctgggga 
caacacgcta 
cagctggcgc 
tcgaaagagt 
. gatgtcaccc 



gggcgattta 
gcgccacgca 
cgatgtttac 
acgtggaagt 
cggtgctgat 
atatgaatcg 
ggtcgtttat 
atttcctgcc 



tccgggtact 
gatgttcgat 
cctggaagag 
ggtcgggttt 
tcgtggcctg 
ccacttaatg 
ctcttcatcg 
ggagaatgtc 



ttcgatccca 
cacgttattc 
cgtgtggcac 
agtgatttaa 
cgtgcggtgg 
ccggaactgg 
ttggtgaaag 
catcaggcgc 



ttaccaatgg 
tggcgattgc 
tggcacagca 
tggcgaactt 
cagattttga 
aaagtgtgtt 
aggtggcgcg 
tgatggcgaa 



tcatatcgat .60 
cgccagcccc 120 
ggcaaccgcg 180 
cgcccgtaat 240 
atatgaaatg 300 
tctgatgccg 360 
ccatcagggc . 420 
gttagcgtag 480 



<210> 31 
<211> 699 
<212> DNA 

<213> Escherichia coli 



-<400> 31 
atgaaaatcg 
gaaaaccgtc 
accgaggttg 
ttgctgttgg 
gcaccaacgt 
gatgtcacgg 
gctgacgata 
gtacgtggcc 
.atccgccaca 
gtctgccaca 
caacagtctc 
atggttgagt 



gcatcattgg 
aaactatcag 
cgcttctgaa 
aacactgcaa 
tgaaagtggg 
catttggtta 
aactgatcgc 
tgattgttag 
acttcccaca 
atttcaacgt 
atcttagctt 
cactggtgca 



tgcaatggaa 
tctcggcggt 
atcgggcatc 
gccagatgtg 
cgatatcgtt 
tgaatacggt 
tgccgctgag 
cggcgacgct 
ggccattgct 
cccgtttgtt 
cgatgagttc 
gaaacttgca 



gaagaagtta 
tgcgaaatct 
ggtaaagtcg 
attattaaca 
gtctcggacg 
cagttaccag. 
gcctgcattg 
ttcatcaacg 
gtagagatgg 
gtcgtacgcg 
ctggctgttg 
catggctaa 



cgctgctgcg 
ataccggcca 
ctgcggcgct 
ccggttctgc 
aagcacgtta 
gctgtccggc 
ccgaactgaa 
gttctgttgg 
aagcgacggc 
ccatctccga 
ccgctaaaca 



tgacaaaatc- 60 
actgaatgga 120 
gggtgccact 180 
cggtggcctg ,240 
tcacgacgcg 300 
aggctttaaa" 360 
tcttaacgct 420 
tctggcgaaa 480 
aatcgcccat 540 
cgtggccgat 600 
gtccagcctg 660 
699 



<210> 32 
<211> 1344 
<212> DNA 

<213> Escherichia coli 



<400> 32 

gtgagcaatc tgtcgctcga tttttcggat aatacttttc aacctctggc cgcgcgtatg 60 
cggccagaaa atttagcaca gtatatcggc cagcaacatt tgctggctgc ggggaagccg 120 
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ttgccgcgcg c^atcgaagc cgggcattta cattctatga tcctctgggg gccgccgggt 180 

accggcaaaa caactctcgc tgaagtgatt gcccgctatg cgaacgctga tgtggaacgt 240 

atttctgccg tcacctctgg cgtgaaagag attcgcgagg cgatcgagcg cgcccggcaa 300 

aaccgcaatg caggtcgccg cactattctt tttgttgacg aagttcaccg tttcaacaaa 360 

agccagcagg atgcatttct gccacatatt gaagacggca ccatcacttt tattggcgca 420 *" 

accactgaaa acccgtcgtt tgagcttaat tcggcactgc tttcecgtgc ccgtgtctat 480 ■ 

ctg'ttgaaat ccctgagtac agaggatatt gagcaagtac taactcaggc gatggaagac 540 

aaaacccgtg gctatggtgg tcaggatatt gttctgccag atgaaacacg acgcgccatt 600 

gctgaactgg tgaatggcga cgcgcgccgg gcgttaaata cgctggaaat gatggcggat 660 

atggccgaag tcgatgatag cggtaagcgg gtcctgaa'gc ctgaattact gaccgaaatc 720 

gccggtgaac gtagcgcccg ctttgataac aaaggcgatc gcttttacga tctgatttcc 780 

gcactgcata agtcggtacg tggtagcgca cccgatgcgg cgctgtactg gtatgcgcga 840 

attattaccg ctggtggcga tccgttatat gtcgcgcgtc gctgtctggc gattgcgtct 900 

gaagacgtcg gtaatgccga tccacgggcg atgcaggtgg caattgcggc ctgggattgc 960 

tttactcgcg ttggcccggc ggaaggtgaa cgcgccattg ctcaggcgat tgtttacctg 1020 

gcctgcgcgc caaaaagcaa cgctgtctac actgcgttta aagccgcgct ggccgatgct 1080 

cgcgaacgcc cggattatga cgtgccggtt catttgcgta atgcgccgac gaaattaatg 1140 

aaggaaatgg gctacgggca ggaatatcgt tacgctcatg atgaagcaaa cgcttatgct 1200 
gccggtgagg tttacttccc gccggaaata gcacaaacac gctattattt . cccgacaaac • 1260 

aggggccttg aaggcaagat tggcgaaaag ctcgcctggc tggctgaaca ggatcaaaat 1320 
agccccataa aacgctaccg ttaa 1344 



<210> 33 
<211> 1911 
. <212> DNA 

<213> .Escherichia coli - ■ /. 

. <400> 33 

gtgacggacg attttgcacc agacggtcag ctggcgaaag cgataccagg ctttaagccg 60 
cgagaaccac agcgacagat ggcggtagcc gtcacccagg cgatagaaaa' aggccagccg 120 
ctggtggtgg aagcaggaac cggtacgggc aaaacctacg cttacctggc tcctgcgctg 180 
cgggcgaaaa agaaagtcat tatctcgacc ggctcaaaag cgttgcagga tcagctctac :: 240 
agccgcgatt tgccaacagt ctcaaaggca ttgaaatata c^ggcaacgt ggcgctgctg 300 
aaagggcgct caaactacct ctgcctcgaa cgtctcgaac agcaggcgct ggcggggggc .36P 
gatctgccgg tacaaatctt aagcgatgtg atcctgctgc gctcctggtc- taatcaaaca 420 
gtcgatggtg atatcagcac ctgcgtcagc gtggcggaag attcacaggc gtggccgctg 480 
gtcaccagca ccaacgacaa ctgtcttggc agcgactgcc cgatgtataa agattgcttt 540 
gtggtcaaag cacgtaaaaa agcgatggac gccgatgtgg tggtggtaaa ccatcatctc 600 
tttctggcgg atatggtggt taaagagagt ggatttggcg aactgatccc ggaagcggac 660 
gtcatgatct tcgacgaagc ccaccagcta ccggacattg ccagccagta ttttggtcag 720 
tcactctcca gtcgacaact gctcgacctg gcaaaagaca tcaccatcgc ctaccgcacc 780 
gaattaaaag acacccagca gttacaaaag tgcgctgatc gtcttgccca gagtgcgcag 840 
gattttcgtc tgcaactcgg tgagccaggt tatcgcggta acctgcgtga gctgttagct 900 
aatccgcaaa ttcagcgggc* atttttactg ctcgatgaca ccctggaact ttgttatgac 960 
gtggcgaaac tgtcactggg gcgttccgcc ttgctggatg cggcatttga gcgcgccacg 1020 
ttgtatcgca cacggctgaa gcggctaaaa. gagatcaatc agccgggcta cagctactgg 1080 
tacgaatgca cttcgcgcca ttttactctg gctctcacgc cgctcagcgt ggcggataaa 1140 
. ttcaaagagt taatggcgca aaaacccggt agctggatct tcacctcagc aacgctgtcg 1200 
gtgaacgacg atctgcatca tttcacctcg cggcttggca tcgaacaggc cgagtcgttg 1260 
ctgttgccca gcccatttga ttacagccgc caggcgttac tctgtgtgct gcgcaatctg 1320 
ccgcaaacca accagccagg ttctgctcgc cagttagcgg caatgctgcg accgatcatc 1380 
gaagctaaca acggtcgttg ttttatgctt tgtacctcgc acgccatgat gcgcgatctg 1440 
gccgagcagt tccgcgctac catgacgctt cctgtattgt tgcaggggga aaccagcaaa 1500 
gggcaactgt tgcagcaat't tgtcagcgcc ggtaatgcgc ttcttgtggc aaccagcagt 1560 
ttctgggaag gggtggacgt gcgtggcgat acattgtcat tggtaatt'at cgacaaattg 1620 
ccgtttacct cgccggatga tccactgtta aaagcgcgca tggaagattg tcgtttgcgc 1680 
ggtggcgacc .cgttcgatga agtgcaacta ccagatgccg tcattactct caaacagggg 1740 
gtagggcgac tgattcgcga cgccgacgat cgtggcgtgc tggtgatttg tgacaatcgg 1800 
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ctggtgacgc gtccttacgg cgcgacgttt ctcgccagtc tgccgcccgc gccacgcacc 1860 
cgtgacattg cccgtgcggt tcgtttcctt gcgataccat cctccaggta a 1911 



<210> 34 
<211> 414 
<212> DNA 

<213> Escherichia coli 



<400>' 34 

atggctaata 

gtgacgcaga 

gacggcgtat 

gattccggct 

atcaaagact 

catctggggc 

tctgcctctt" 



aaccttcggc 
atcatgggac 
atcactgttt 
gtggctggcc 
tgtcacatgg 
atgtcttccc 
tacgctttac 



agaagaactg 
agaaccgcca 
gatctgcgat 
cagtttctac 
aatgcagcgc 
cgacgggccg 
cgatggcgaa 



aaaaaaaatt 
tttacgggtc 
gccccgctgt 
gaaccggtaa 
atagaaattc 
cagccaacgg 
aacggcgaag 



tgtccgagat 
gtttactgca 
ttcattccca 
gtgaagaatc 
gttgcggtaa 
gcgaacgtta 
aaatcaacgg 



gcagttttac 60 
taacaagcgt 120 
aaccaagtat 180 
cattcgttat 240 
ctgtgatgcc 3C0 
ttgtgtraac 360 
ttga 414 



<210> 35 
<211> 1968 
<212> DNA 

<213> Escherichia coli 



<400> 35 
atgaccattg 
gcggcggggc 
ggcaacctgt 
ggcaaagagg 
atcgcgctgg 
aaggagatca 
gggcgctcgc 
atcgtgtttc 
acctgcgata 
actattctgg 
gtgcagacca 
ctgggctgtc 
acctcgcagg 
tccgggcagg 
gatagccgcg. 
gtgatccacg 
cacgcggcgg 
ccgcgtctgg 
ctcgcgggcg 
gacgccatga 
gagcgccggg 
gtgatcctgc 
acgggcaaca 
gtgggcgaag 
caggcgatca 
ggcggcgggc 
catatctcgt 
acgggcgcct 
ctgcgcgata 
aacttcatcg 
cggcggcaga 
• tgggcggcac 
-aaaattatcg 



agaaaatttt 

cgcagggcgc. 

tcggcatgac 

tgctgattat 

gctaccacac 

cccgcaatgg 

agggcacgca . 

gccgcctgat 

aagggctgcc 

tgccgggcgg 

tcggcgcgcg 

gcgcctgcgc 

tggtcgcgga 

cggtgtggct 

gcatcaccac 

cggcgttcgg 

gctgcacgat 

tgagcgtgct 

gcgtgccgga 

ccgtgaccgg 

cgcgcttccg 

cgccggagaa 

tcgctccgga 

atggcgtata 

aggcgatcaa 

cgtccggcac 

^ggggcaagac 

gcttcggcca 

acgacatcat 

gcaccgcgga 

cgcacccgga 

tgcagtcggt 

aggtaattaa 



caccccgcag 
tctgccgctg 
gcagaacgcc 
cggcactcag 
cgggcattgg 
cgggatcccg 
cggtatgttc 
ccgctccctg 
cgccaccatg 
ggcgacgctg 
tttcgccaac 
ctcgccgggc 
ggcgctgggt 
ggagatcgcc 
gcgggatatc 
cggctccacc 
cccggacgtt 
gcccaacggc 
ggtgatgctc 
ccagacggtg 
ccagtgcctg 
ggcaaaagcg 
aggttcggtg 
ccaccacacc 
gcgggaagag 
cggcatggaa 
ggtgtcgctc 
cgtgtcgccg 
cgagattgcc 
caacccgctg 
cctgcacgcc 
gagcggcggc 
cgccggtaaa 



gacgacgcgt 
accccgcaga 
gggatgggct 
ggcggcatcc 
gagatcggca 
ttcgcggcct 
gattccctgc 
ccgacgcggc 
attgcgctgg 
ccgccgaccg 
cacgaactct 
ggcgggtgtc 
ctggcgctgc 
cgccagtcgg 
ctctccgata 
aatttactgc 
gagcactgga 
ccggactatc 
cacctgcgcg 
ggcgagaacc 
cgcgagcagg 
aaagggctga 
atcaaggcca 
ggccgggtgc 
attgtgcagg 
gagacctacc 
atcaccgatg 
gaggcigctgg 
gtgga.tcgtc 
acgccggaag 
cacgactttt 
acctggaaag 
aaagcgctcg 



tttatgcggt 
tgctgatgga 
gggacgccaa 
gcgccggaga 
.tgcagatgca 
tcgtcagcga 
cgtaccgcaa 
gggcggtgat 
ccgcgatgca 
tcggggaaga 
ccctgcagga 
.agtt-cctcgg 
cgcactccgc 
cgcgcgcggt 
aagccatcga 
tgcacattcc 
cgcgcatcaa 
acccgaccgt 
acctcggcct 
ttgaatggtg 
acggc^taga 
cctcgacggt 
cggcgatcga 
gggtgtttgt 
gcgatatcat 
agctcacctc 
cgcgcttctc 
cgggcgggcc 
tgacgttaac 
agggcgcgcg 
tgccggacga 
gctgtattta 
gaatttaa 



gatcacccac 
atctcccagc 
caagctcacc 
cggacgccca 
ggcggcggcg 
tccgtgcgac 
cgacgcggcg 
cggcgtagcg 
cgacctgccg 
cgcgggcaag 
ggccgccgaa 
cacggcgggc 
gctggcgccg 
cagcgagctg 
aaacgcgatg 
ggccatcgcc 
ccgtaaagtg 
gcgcgccttc 
gctgcatctg 
gcaggcgtcc 
gccggatgac 
ctgcttcccg 
cccgtcggtg 
ctcggaagcg 
ggtggtgatc 
cgcgctaaag 
gggcgtgtcg 
gattggcaag 
tggcagcgtg 
cgagctggcg 
cacccggctg 
tgacaccgat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140~ 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800' 

1860 

1920 

1968 
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<210> 36 

<211> 717 

<212> DNA 

<213> Escherichia coli 



<400> 36 

gtgggacgta 

aaaatttatg 

ccagaattaa 

aagcacgtta 

cagggacgtt 

tcaaatgtta 

atcggagcgg 

gggacagacc 

gtgactgaag 

atcgcggctc 

gctcaatctg 

gcccttgaag 



aatgggccaa 
caaaattcgg 
acacatcttt 
ttgataaagc 
atgaaggctt 
accgtacgat 
caggttctgt 
ctgaccatat 
aagaa'ggtaa 
taaaagcagc 
aagttgagct 
atgacgacga 



tattgttgct 
tgtagaaatc 
aaaattcgtt 
aattgataaa 
tggtcctaat 
tgctaacgtt 
cagctatatg 
ttttgaaatt 
cattgttatt 
tggaatcact 
ttccccagaa 
tgtacaaaaa 



aaaaaaacgg 
tatgctgctg 
attgaacgtg 
gccaaaggcg" 
ggctcaatga 
cgcacaattt 
tttgacaata 
ttacttgaag 
tatactgaac 
gagttctcaa 
gatttagaaa 
gtttatcata 



ctaaagacgg 
ctaaacaagg' 
caaagcaggc 
gcggagatga 
ttatcgccga 
tcaataaaaa 
cgggtgtgat 
ctgaagttga 
ctactgacct 
caacagaatt 
tctttgaagg 
acgtcgcaaa 



tgcaacgtct 
tgaacccgat 
acaagttcca 
aacgttcgtg 
gacattgact 
aggcggcaat 
tgtatttaaa 
tgttcgtgat 
tcataaagga 
agaaatgatt 
gcttgttgat 
tctctaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

717 



<210> 37 

<211> 258 

<212> DNA ■ 

<213> Escherichia coli 

<400> 37 

atggcgccgc cactgtcgcc t'ggctcgcgg gtcctgatag ccctcattcg gg'tctatcaa 60 
cgcctgatta gtccgctact cgggccgcat tgtcgtttca ctccaacctg ttcaagctac 120 
ggaattgagg cattgcgcag gtttggagtg ataaaaggca. gttggttgac ggtgaaacgc 180 
gtattaaaat gccacccttt acaccctggt ggtgacgatc ccgtcccgcc cggaccattt 240 
gataccagag ;aacactaa 258 



<210> 38 

<211> 1023 . 
<212> DNA 

<213>. Escherichia coli 



<400> 38 

atgaaaaaag 

gtgggcgtct 

atatttaccc 

aagatcatca 

cactttaaag 

ttgctggaaa 

agcgattacc 

aagtacgcca 

ttctggccag 

gcgcacaaga 

ccttataaag 

gttgccagtg 

cgcctgcaga 

tctcgtgcag 

ccaggtgcga 

acgccgtatc 

gccagtcata 

taa 



tgttattgat 
ggaaggttcg 
tgaagccagg 
atcgtccacg 
ccgggactta 
gcggtaaaga 
tcaagcaatt 
ccgtagcgca 
acacctggat 
aaatggtgaa 
ataaaaa'cca 
aacgcgataa 
ccgacccgac 
acctggaaac 
tagcgacgcc 
tctattttgt 
acaagtctgt 



aatcttgtta 
ccatcttgcc 
gaccggacgt 
ggtttttcaa 
ccgctttaca 
agcacagttc 
gcgtgaggcc 
ggcacttgaa 
gtataccgcc 
agcggtcgat 
gttggtgacg 
ggttgcctca 
cgtgatttac 
gccgacagcg 
gggggcggat 
ggccgatggt 
gcaggattat 



ttgctggtgg 
gacagcaaat 
ctggcgctcg 
tggctgctgc 
ccgcagatga 
cctctgcgac 
ccgtatatca 
ctggaaaacc 
aataccaccg 
agcgcctggg 
atggcatcaa 
gtatttatca 
gggatgggag 
tataacacct 
tcgctgaagg 
aaaggtggtc 
ctgaaagtgc 



tactgggtat 
tgcttatcaa 
gtgaacagct 
gtatcgaacc 
ccgtgcgcga 
tggtagaagg 
agcatacgct 
cggagtggat 
atgtcgcgtt 
aagggcgtgc 
ttatcgaaaa 
accgtttacg 
agcgttataa 
ataccattac 
ctgctgcgca 
acacgtttaa 
ttaaggaaaa 



cgccgctggt 
agaagagacg 
ttatgccgat 
ggatctttct 
gatgctgaaa 
gatgcgtctg 
gagcgatgat 
tgaaggttgg 
actcaagcga 
ggacggtctg 
agaaaccgcc 
cattggtatg 
tggcaaactt 
cggtctgccg 
tceggcaaaa 
taccaatctt 
aaatgcgcag 



60 

120 

180 

24 0 

300 

360 

420 

480 

540 ■ 

600 

660 

720 

780 

840 

900 

960 

1020 

1023 
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<210> 39 
<211> 873 
<212> DNA 

<213> Escherichia coli 



<4-00> 39 

atgctgcccg 

cgccgcgaag 

accattggcg 

cctcgggaag . 

accaccgaag 

ttcccgattg 

gatctggtga 

gtcgataaac 

gggacagtga 

ctggtgcagg 

aaaaagctgg 

tggcgcgatt 

tccgaggtaa 

gttaagatgg 

acctcgccgg 



actcatcagt 
cggatcgcta 
atcaggtgaa 
ccgaagattt 
atggcgagcg 
gccgcctgga 
ataagatcct 
cgattaccga 
ccaaaaagtg 
ggctgaaccg 
aacgcacgcg 
taaccgacga 
aacctaaagc 
aaaaaacggc 
ggcgtaaaaa 



ccgtttaaat 
tatcgagcaa 
acccggcgac 
ggtacttatc. 
cgataacatt 
taaagactcc 
.gcgtgctggc 
ggagtttatt 
caaagttaaa 
tcagatccgg 
catcatgaac 
tgagttaatc 
gaaggccaaa 
ggaaaaaggc 
gaaggggcgc 



aaatacatca 
ggcaatgtgt 
gttgtgaaag 
gccctga'aca 
gtcgatttcg 
caggggctga 
aatgatcatg 
cgcggcatga 
aaagaagcgc 
cgcatgtgcg 
gttagcttaa 
gacctcttta 
ccgaaaacag 
gg.tcgcccgg 
tga 



gcgaaagcgg 
tccttaatgg 
taaacggtca 
agcccgttgg 
ttaaccacag 
ttttcctcac 
agaaagagta 
gtgcgggggt 
cgtttgtctt 
agcatttcgg 
gcggcattcc 
agctcattga 
cgggcatcaa 
cgtccaacgg 



aatttgctca 
caagcgagcc 
gttgattgaa 
tattgtaagc 
caaacgcgtg 
caatcacggc 
tctggt'gacg 
gccaatcctc 
ccgcattacc 
ctatgaagtg 
gctgggggaa 
aaattcctct 
acgtccagtc 
taagcgtttt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

873 



<210> 40' 
<211> 159 
<212> PRT 

<213> Escherichia coli 



<400> 40 

Met Arg He Gly His Gly Phe 
1 5 

Pro He He lie Gly Gly Val 

'20 

Ala His Ser Asp Gly Asp Val 
35 

Leu Gly Ala Ala Ala Leu Gly 
50 - 55 

Asp Pro Ala Phe Lys Gly Ala 
* 65. 70 

Trp Arg Arg lie Gin Ala Lys 
85 



Asp Val His Ala 
10 

Arg He Pro' Tyr 
25 

Ala Leu His Ala 

40 • 

Asp lie Gly Lys 



Asp Ser Arg Glu 
75 

Gly Tyr Thr Leu 
90 



Phe Gly Gly Glu Gly 
15 

Glu Lys. Gly Leu Leu 
. 30 

..Leu Thr Asp Ala Leu 
45 

Leu Phe Pro Asp Thr 
60 

Leu Leu Arg Glu Ala 
80 

Gly Asn Val Asp Val 
95 



Thr He He Ala Gin Ala Pro 
100 

Arg Val Phe He Ala Glu Asp 
115 

Val Lys Ala Thr Thr Thr Glu 
130 — 135 

Gly lie Ala Cys Glu Ala Val 
145 150 



Lys Met Leu Pro 

. 105 

Leu Gly Cys His 
120 . 

Lys Leu Gly Phe 



Ala Leu Leu lie 
155 



His He Pro Gin Met 
110 

Met Asp Asp Val Asn- 
125' 

Thr Gly Arg Gly Glu 
140 

Lys Ala Thr. Lys 



WO 00/61793 



14 



PCT/EP00/03135 



<210> 41 
<2il> 158 
<212> PRT 

<213> Haemophilus influenzae 
<400> 41 

Met lie Arg lie Gly His Gly Phe Asp Val His Ala -Phe Gly Glu Asp 
1 . 5 -10 15 * 

Arg Pro Leu He He Gly Gly Val Glu Val Pro Tyr His Thr Gly Phe 
20 25 , ■ 30 

He Ala' His Ser Asp Gly Asp Val Ala Leu His 'Ala Leu Thr Asp Ala 
35 . 40 45 

He Leu Gly Ala Ala Ala Leu Gly Asp He Gly Lys Leu Phe Pro Asp 
50 . .55 60 

Thr Asp Met Gin Tyr Lys Asn Ala Asp Ser Arg Gly Leu Leu Arg Glu 
65 70 75 - 80 

Ala Phe Arg Gin Val Gin Glu Lys Gly Tyr Lys He Gly Asn Val Asp 
85 , 90 95 

lie Thr He He Ala Gin Ala Pro Lys Met Arg Pro His He Asp Ala 
100 105 . 110 

Met Arg Ala Lys He Ala Glu Asp Leu Gin Cys Asp He Glu Gin Val. 
115 120 . 125 

Asn Val Lys Ala Thr Thr Thr Glu Lys Leu Gly Phe Thr Gly Arg Gin 
130 135 " 140 

Glu Gly lie Ala Cys Glu Ala Val Ala Leu Leu lie Arg Gin 
145 ■' . - 150 155 



<210> 42 
<211> 158 . 
<212> PRT 

<213> Bacillus subtilis 
<400> 42 - 

Met Phe Arg He Gly Gin Gly Phe Asp Val His' Gin Leu Val Glu Gly 
1.5 10 15 

Arg -Pro Leu He He Gly Gly He Glu He Pro Tyr Glu Lys Gly Leu 
20 .25 ^ 30 

Leu Gly His Ser Asp Ala Asp Val Leu- Leu His Thr Val Ala Asp Ala 
35 . 40 45 

Cys Leu Gly Ala Val Gly Glu Gly Asp lie Gly Lys His Phe Pro Asp 
50 55 ' 60 1 

Thr Asp Pro Glu Phe Lys Asp Ala Asp Ser Phe Lys Leu Leu Gin His 
65 70 75 80 
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Val Trp Gly lie Val Lys Gin Lys Gly Tyr Val Leu Gly Asn lie- Asp 
,85 90 95 

Cys Thr lie lie Ala Gin Lys Pro Lys Met Leu Pro Tyr He Glu Asp 
100 105 110 

Met Arg Lys Arg He Ala Glu Gly Leu Glu Ala Asp Val Ser Gin Val 
115 120 125. 

Asn Val Lys Ala Thr Thr Thr Glu Lys Leu Gly Phe Thr Gly Arg Ala 
130 . 135 . . 140 

Glu Gly He Ala Ala Gin Ala Thr Val Leu He Gin Lys Gly 
145 150 155 



<210> 43 

<211> 161 " " 

<212> PRT 

<213> Synechocystis sp. 

^400> 43' ' 
Met Thr Ala Leu Arg .He Gly Asn Gly , Tyr Asp He His Arg Leu Val 
1 5 10 15 

Gly Asp Arg Pro Leu lie Leu Gly Gly Val Thr He Ala His His Leu 
20 25 30 

Gly Leu Asp Gly f His Ser Asp Ala Asp Val Leu Thr His Ala Leu Met 
35 40 45 

Asp Ala Leu Leu Gly Ala : Leu Ser Leu Gly Asp He Gly His Tyr Phe 
50 55 60 . 

Pro Pro Ser Asp Ala Arg Trp Gin Gly Ala Asp Ser Leu Lys Leu Leu 

65 ■ ' .70 75 - 80 

Ala Gin Val His Gin Leu He Leu Glu Arg- Gly Trp Arg He Asn Asn 
85 90. ■ 95 

Leu Asp Asn Val He Val Ala Glu Gin Pro Lys Leu Lys Pro His He 
100 ^ 105 . 110 

Gin Ala Met Lys Glu Asn . Leu Ala^ Lys Val Leu Thr He Asp Pro Asp 
115 120 125 

Leu He Gly He Lys Ala Thr Thr Asn Glu Arg Leu Gly Pro Thr Gly 
130 135 140 

Arg Glu Glu Gly He Ala Ala Tyr Ser Val Ala Leu Leu lie Lys Glu 
145 150 * 155 160 

Gly 



<210> 44 
<211> 399 
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<212> PRT 

<213> Treponema pallidum 
<400> 44 

Met Arg Arg Gly Gly Ala Cys Val Gin Lys Lys' Glu Tyr Leu Pro Leu 

1 5 10 15 . ■ . - 

Thr Ser Arg Gin Pro Gly Val Cys Leu Leu Ser Glu He Leu Val Arg 
20 25 30 

Ala Leu Glu' Ala Arg Ser Phe Phe Leu Val Val Val Thr Val Pro Ala 
35 • ' 40 45 

Gly Glu Val Ala Tyr Ala- Glu Ser Gin Val Ala Cys Asp Ser Arg Leu 
50. 55 * 60 

Ser Ala Phe Pro Ser Arg Thr Arg Pro Val He Leu. Tyr Val Pro Gly 

65 ■ ' 70 '75 . ^ 80 

Ala His Thr Arg Ser Ala Ser Val Arg Ala Gly Leu Asp Ala Met Ala 

85 90 95 . 

Thr His Ala Pro Asp Val Val Leu Val His. Asp Gly Ala Arg Pro Phe 
100 105 HO 

Val Ser Val Ala Leu He His Ser Val Leu Glu Ala Thr Cys Arg Tyr 
115 120 - 125 

Gly Ala Ala Val Pro Val He Glu Ala Thr Asp Thr Pro Lys Gly Val 
130 135 '140 

Ala Ala Asp Gly Ser He Glu Thr His Leu He Arg Ser Arg Val Arg 
145 . 150 155 160 

■ Leu Ala Gin Thr Pro Gin Gly Phe Cys Tyr Ala Ser Leu Cys Ala Ala 
165 170 ' 175 - 

His ' His - Arg Ala Ala Thr Asp Gly Glu Gin .Tyr Thr Asp Asp Ser Glu 
180 185 190 

Leu Tyr Ala "Arg Tyr Gly Gly Thr Val His Val Cys Ala Gly Glu Arg. 
• 195 200 205 

Ser Asn Val Lys He Thr Tyr Pro Glu Asp Leu Glu Gin Arg Ala Ser 
210 215 220 

Glu Pro Ala Leu Thr Arg Gly He Ser Val Leu Pro Cys Thr Glu Glu 
225 230 235 240 

Gly Ala Leu Arg Val Gly Leu Gly Thr Asp Met His Ala Leu Cys Ala 
245 250 .255 

Gly Arg Pro Leu He Leu Ala Gly He His He Pro Ser Lys Lys Gly ■ 
260 265 270 

Ala Gin Gly His Ser -Asp Ala Asp Val Leu Ala His. Ala Ser lie Asp 
275 280 285 



Ala Leu, Leu Gly Ala Ala Gly Leu Gly Asp He Gly Thr Phe Phe Pro 
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290 ' 295 300 

Ser Cys Aso Gly Arg Trp Lys Asp Ala His Ser Cys Ala Leu Leu Arg 
305 310 315 • 320 , 

His Thr Trp Gin LeuValArg Ala Ala Cys Trp Arg Leu Val Asn Leu 
325 . 330 335 

Asp Ala Val Val Cys Leu Glu Gin Pro Ala Leu His Pro Phe Arg Glu 
340 "345 350 

Ala Met Arg Ala Ser Leu Ala Gin Ala Leu Asp Thr His Val Thr Arg 
355 360 '365 

Val Phe Val Lys Ala Lys Thr Ala Glu Arg Leu Gly Pro Val Gly Ser 
370 " 375 .380 

Gly Ala Ala- Val Thr Ala Gin Val Val Val Leu Leu Lys Lys lie 
385 .390 395 



<210> 45 
<211> 406 
<212> PRT 

<213> Helicobacter pylori _ 
<400> 45 

Met Ser Leu He Arg Val Asn Gly Glu Ala Phe Lys Leu Ser Leu Glu 
1 5 10 - 15 

Ser Leu Glu Glu Asp Pro Phe Glu Thr Lys Glu Thr Leu Glu Thr Leu 
20 25 30 

He Lys Gin' Thr Ser Val Val Leu Leu Ala Ala Gly Glu Ser Arg Arg 
- 35 40 45 

Phe Ser Gin Thr He Lys Lys Gin Trp Leu Arg Ser Asn His Thr Pro 
50 55 60 1 

Leu Trp Leu Ser Val Tyr Glu Ser Phe Lys. Glu Ala Leu Asp Phe Lys 
65, 70 75 80 

Glu He He Leu Val Val Ser Glu Leu- Asp Tyr He Tyr He Lys Arg 
, 85 90 . 95 

His Ty- Pro Glu He Lys Leu Val Lys Gly Gly Ala Ser Arg Gin Glu 
100 105- HO 

Ser Val Arg Asn Ala Leu Lys He He Asp Ser Ala Tyr Thr Leu Thr 
115 - 120 125 

Ser Aso Val Ala Arg Gly Leu Ala Asn He Glu Ala Leu Lys Asn Leu 
130 135 - 140 

' Phe Leu Thr Leu Gin Gin Thr Ser His Tyr Cys He Ala Pro Tyr Leu 
145 ^ 150 155. 160 ; 

Pro Cys Tyr Asp Thr Ala He Tyr Tyr Asn Glu Ala Leu Asp Arg Glu 
165 170 175 
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